(19) 




Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(11) 



(12) 



EUROPEAN PATENT SPECIFICATION 



(45) Date of publication and mention 
of the grant of the patent: 
01.03.2006 Bulletin 2006/09 

(21) Application number: 98943469,1 

(22) Date of filing: 31.08.1998 



(51) Intel.: 

C12N 15/56<^°^^^i 
C12N 1/21<^^^'> 
C12N 15/11<^^^^'> 





1 




EP 0 998 569 B1 



C12N 15/63 (^^"^ 
C12N9/24(^'''> 
A61K 38/47 (^°^^'> 



(86) International application number: 
PCT/US1 998/01 7954 



(87) International publication number: 

WO 1999/011798 (11.03.1999 Gazette 1999/10) 



(54) POLYNUCLEOTIDE ENCODING A POLYPEPTIDE HAVING HEPARANASE ACTIVITY AND 
EXPRESSION OF SAME IN TRANSDUCED CELLS 

POLYNUKLEOTID KODIEREND FUR BIN POLYPEPTID MIT HEPARANASE-AKTIVITAT UND 
DESSEN EXPRESSION IN TRANSDUZIERTEN ZELLEN 

POLYNUCLEOTIDE CODANT POUR UN POLYPEPTIDE PRESENTANT UNE ACTIVITE 
D'HEPARANASE ET SON EXPRESSION PAR DES CELLULES TRANSDUCTEES 



OQ 

CO 

in 

00 



Q. 
LJJ 



(84) Designated Contracting States: 

AT BE CH DE DK ES Fl FR GB GR IE IT LI NL PT 

(30) Priority: 02.09.1997 US 922170 

02.07.1998 US 109386 

(43) Date of publication of application: 
10.05.2000 Bulletin 2000/19 

(60) Divisional application: 
04001481.3/1 439 193 
04001482.1 / 1 439 226 
04018465.7/1 489 183 



(73) Proprietors: 

• Insight Biopharmaceuticals Ltd 
76121 Rehovot(IL) 

• HADASIT MEDICAL RESEARCH SERVICES AND 
DEVELOPMENT 

LTD. 

91120 Jerusalem (IL) 

(72) Inventors: 

• PECKERJris 

75204 Rishon Le Zion (IL) 

• VLODAVSKY, Israel 
90805 Mevaseret Zion (IL) 

• FEINSTEIN, Elena 
76214 Rehovot (IL) 



(74) Representative: Wachenfeld, Joachim 
Vossius & Partner 
Postfach 86 07 67 
81634 Miinchen (DE) 



(56) References cited: 
WO-A1 -95/041 58 



US-A- 5 362 641 



• JIN L ET AL: "MOLECULAR CLONING AND 
EXPRESSION OF HUMAN HEPARANASE CDNA" 
PROCEEDINGS AMERICAN ASSOCIATION FOR 
CANCER RESEARCH ANNUAL MEETING 1992, 
vol. 33. 1992, page 57 XP0021 39971 ISSN: 
0197-016X 

• FREEMAN CRAIG ET AL: "Human platelet 
heparanase: Purification, characterization and 
catalytic activity." BIOCHEMICAL JOURNAL 
MARCH 15, 1998, vol. 330, no. 3, 15 March 1998 
(1998-03-15). pages 1341-1350, XP0021 39978 
ISSN: 0264-6021 

• DATABASE GENBANK ON STN, US NATIONAL 
LIBRARY OF MEDICINE, (Bethesda. MD, USA), 
No. N32056, HILLIER et al., "The WashU-Merck 
EST Project", 10 January 1996, XP002915282 



Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give 
notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in 
a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art. 
99(1) European Patent Convention). 



Printed by Jouve. 75001 PARIS (FR) 



(Cont. next page) 



EP 0 998 



569 B1 



• DATABASE GENBANK ON STN, US NATIONAL 
LIBRARY OF MEDICINE, (Bethesda, MD, USA), 
No. N30845, HILLIER et al., "The WashU- Merck 
EST Project'*, 05 January 1996, XP002915283 

• DATABASE GENBANK ON STN, US NATIONAL 
LIBRARY OF MEDICINE, (Bethesda, MD, USA), 
No. N30824, HILLIER et al., "The WashU-Merck 
EST Project**, 05 January 1996, XP002915284 



• DATABASE GENBANK ON STN, US NATIONAL 
LIBRARY OF MEDICINE, (Bethesda, MD, USA), 
No. AA304653, ADAMS et a!., "Initial Assessment 
of Human Gene Diversity and Expression 
Patterns Based Upon 83 Million Nucleotides of 
cDNA Sequence", 18 April 1997, XP002915285 



2 



EP 0 998 569 B1 



Description 

FIELD AND BACKGROUND OF THE INVENTION 

s [0001] The present Invention relates to a polynucleotide, referred to hereinbelow as hpa, encoding a polypeptide 
having heparanase activity, vectors Including same and transduced cells expressing heparanase. The invention further 
relates to a recombinant protein having heparanase activity. 

[0002] Heparan sulfate proteoglycans: Heparan sulfate proteoglycans (HSPG) are ubiquitous macromolecules 
associated with the cell surface and extra cellular matrix (ECM) of a wide range of cells of vertebrate and invertebrate 

10 tissues (1-4), The basic HSPG structure includes a protein core to which several linear heparan sulfate chains are 
covalently attached. These polysaccharide chains are typically composed of repeating hexuronic and D-glucosamine 
disaccharide units that are substituted to a varying extent with N- and O-linked sulfate moieties and N-linked acetyl 
groups (1-4). Studies on the involvement of ECM molecules in cell attachment, growth and differentiation revealed a 
central role of HSPG in embryonic morphogenesis, angiogenesis, neurite outgrowth and tissue repair (1-5). HSPG are 

15 prominent components of blood vessels (3). In large blood vessels they are concentrated mostly in the intima and inner 
media, whereas in capillaries they are found mainly in the subendothelial basement membrane where they support 
proliferating and migrating endothelial cells and stabilize the structure of the capillary wall. The ability of HSPG to interact 
with ECM macromolecules such as collagen, laminin and fibronectin. and with different attachment sites on plasma 
membranes suggests a key role for this proteoglycan in the self-assembly and insolubility of ECM components, as well 

20 as in cell adhesion and locomotion. Cleavage of the heparan sulfate (HS) chains may therefore result in degradation of 
the subendothelial ECM and hence may play a decisive role in extravasation of blood-borne cells. HS catabolism is 
observed in inflammation, wound repair, diabetes, and cancer metastasis, suggesting that en2:ymes which degrade HS 
play important roles in pathologic processes. Heparanase activity has been described in activated immune system cells 
and highly metastatic cancer cells (6-8), but research has been handicapped by the lack of biologic tools to explore 

25 potential causative roles of heparanase in disease conditions. 

[0003] Involvement of Heparanase in Tumor Cell Invasion and Metastasis: Circulating tumor cells arrested in 
the capillary beds of different organs must invade the endothelial cell lining and degrade its underlying basement mem- 
brane (BM) in order to invade into the extravascular tissue{s) where they establish metastasis (9, 10). Metastatic tumor 
cells often attach at or near the intercellular junctions between adjacent endothelial ceils. Such attachment of the met- 

30 astatic cells is followed by rupture of the junctions, retraction of the endothelial cell borders and migration through the 
breach in the endothelium toward the exposed underlying BM (9). Once located between endothelial cells and the BM, 
the invading cells must degrade the subendothelial glycoproteins and proteoglycans of the BM in order to migrate out 
of the vascular compartment. Several cellular enzymes (e.g., collagenase IV, plasminogen activator, cathepsin B, 
elastase, etc.) are thought to be involved in degradation of BM (10). Among these enzymes is an endo-p-D-glucuronidase 

35 (heparanase) that cleaves HS at specific intrachain sites (6, 8,11). Expression of a HS degrading heparanase was found 
to correlate with the metastatic potential of mouse lymphoma (11), fibrosarcoma and melanoma (8) cells. Moreover, 
elevated levels of heparanase were detected in sera from metastatic tumor bearing animals and melanoma patients (8) 
and in tumor biopsies of cancer patients (12). 

[0004] The control of cell proliferation and tumor progression by the local microenvironment, focusing on the interaction 
40 of cells with the extracellular matrix (ECM) produced by cultured corneal and vascular endothelial cells, was investigated 
previously by the present inventors. This cultured ECM closely resembles the subendothelium in vivo in its morphological 
appearance and molecular composition. It contains collagens (mostly type III and IV, with smaller amounts of types I 
and V), proteoglycans (mostly heparan sulfate- and dermatan sulfate- proteoglycans, with smaller amounts of chondroitin 
sulfate proteoglycans), laminin, fibronectin, entactih and elastin (13, 14). The ability of cells to degrade HS in the cultured 
45 ECM was studied byallowing cells to interactwith a metabolically sulfate labeled ECM, followed by get filtration (Sepharose 
6B) analysis of degradation products released into the culture medium (11). While intact HSPG are eluted next to the 
void volume of the column (Kav<0.2, Mr - 0.5x1 0®), labeled degradation fragments of HS side chains are eluted more 
toward the V^ of the column (0.5<kav<0.8, Mr =S-7x103) (1 1 ). 

[0005] The heparanase inhibitory effect of various non-anticoagulant species of heparin that might be of potential use 
50 in preventing extravasation of blood-borne cells was also investigated by the present inventors. Inhibition of heparanase 
was best achieved by heparin species containing 16 sugar units or more and having sulfate groups at both the N and 
O positions. While O-desulfation abolished the heparanase inhibiting effect of heparin, O-sulfated, N-acetylated heparin 
retained a high inhibitory activity, provided that the N-substituted molecules had a molecular size of about 4,000 daltons 
or more (7). Treatment of experimental animals with heparanase inhibitors (e.g., non-anticoagulant species of heparin) 
55 markedly reduced (>90%) the incidence of lung metastases induced by B16 melanoma, Lewis lung carcinoma and 
mammary adenocarcinoma cells (7, 8, 16). Heparin fractions with high and low affinity to anti-thrombin III exhibited a 
comparable high anti-metastatic activity, indicating that the heparanase inhibiting activity of heparin, rather than its 
anticoagulant activity, plays a role in the anti-metastatic properties of the polysaccharide (7). 
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[0006] Heparanase activity in the urine of cancer patients: In an attennpt to further elucidate the involvement of 
heparanase in tumor progression and its relevance to human cancer, urine samples for heparanase activity were screened 
(16a). Heparanase activity was detected in the urine of some, but not all. cancer patients. High levels of heparanase 
activity were detemnined in the urine of patients with an aggressive metastatic disease and there was no detectable 
5 activity in the urine of healthy donors. 

[0007] Heparanase activity was also found in the urine of 20% of normal and microalbuminuric insulin dependent 
diabetes mellitus (IDDM) patients, most likely due to diabetic nephropathy, the most important single disorder leading 
to renal failure in adults. 

[0008] Possibie involvement of heparanase in tumor angiogenesis: Fibroblast growth factors are a family of 
10 structurally related polypeptides characterized by high affinity to heparin (17). They are highly mitogenic for vascular 
endothelial cells and are among the most potent inducers of neovascularization (17, 18). Basic fibroblast growth factor 
(bFGF) has been extracted from the subendothelial ECM produced in vitro (19) and from basement membranes of the 
cornea (20), suggesting that ECM may serve as a reservoir for bFGF. Immunohistochemical staining revealed the 
localization of bFGF in basement membranes of diverse tissues and blood vessels (21 ). Despite the ubiquitous presence 
15 of bFGF in normal tissues, endothelial cell proliferation in these tissues is usually very low, suggesting that bFGF is 
somehow sequestered from its site of action. Studies on the interaction of bFGF with ECM revealed that bFGF binds to 
HSPG in the ECM and can be released in an active form by HS degrading enzymes (15, 20, 22). It was demonstrated 
that heparanase activity expressed by platelets, mast cells, neutrophils, and lymphoma cells is involved in release of 
active bFGF from ECM and basement membranes (23), suggesting that heparanase activity may not only function in 
20 cell migration and invasion, but may also elicit an indirect neovascular response. These results suggest that the ECM 
HSPG provides a natural storage depot for bFGF and possibly other heparin-binding growth promoting factors (24, 25). 
Displacement of bFGF from its storage within basement membranes and ECM may therefore provide a novel mechanism 
for induction of neovascularization in normal and pathological situations. 

[0009] Recent studies indicate that heparin and HS are involved in binding of bFGF to high affinity cell surface receptors 
25 and in bFGF cell signaling (26, 27). Moreover, the size of HS required foroptimaleffectwassimilarto that of HS fragments 
released by heparanase (28). Similar results were obtained with vascular endothelial cells growth factor (VEGF) (29), 
suggesting the operation of a dual receptor mechanism involving HS in cell interaction with heparin-binding growth 
factors. It is therefore proposed that restriction of endothelial cell growth factors in ECM prevents their systemic action 
on the vascular endothelium, thus maintaining a very low rate of endothelial cells turnover and vessel growth. On the 
30 other hand, release of bFGF from storage in ECM as a complex with HS fragment, may elicit localized endothelial cell 
proliferation and neovascularization in processes such as wound healing, inflammation and tumor development (24, 25). 
[0010] Expression of heparanase by ceiis of the immune system: Heparanase activity correlates with the ability 
of activated cells of the immune system to leave the circulation and elicit both inflammatory and autoimmune responses. 
Interaction of platelets, granulocytes, T and B lymphocytes, macrophages and mast cells with the subendothelial ECM 
35 is associated with degradation of HS by a specific heparanase activity (6). The enzyme is released from intracellular 
compartments (e.g., lysosomes, specific granules, etc.) in response to various activation signals (e.g., thrombin, calcium 
ionophore, immune complexes, antigens, mitogens, etc.), suggesting its regulated involvement in inflammation and 
cellular immunity. 

^0 Some of the observations regarding the heparanase enzyme were reviewed in reference No. 6 and are iisted 
hereinbelow: 



[0011] First, a proteolytic activity (plasminogen activator) and heparanase participate synergistically in sequential 
degradation of the ECM HSPG by inflammatory leukocytes and malignant cells. 

[0012] Second, a large proportion of the platelet heparanase exists in a latent form, probably as a complex with 
chondroitin sulfate. The latent enzyme is activated by tumor cell-derived factor(s) and may then fadlitate cell invasion 
through the vascular endothelium in the process of tumor metastasis. 

[0013] Third, release of the platelet heparanase from a-granules is induced by a strong stimulant (i.e., thrombin), but 
not in response to platelet activation on ECM. 

[0014] Fourth, the neutrophil heparanase is preferentially and readily released in response to a threshold activation 
and upon incubation of the cells on ECM. 

[001 5] Fifth, contact of neutrophils with ECM inhibited release of noxious enzymes (proteases, lysozyme) and oxygen 
radicals, but not of enzymes (heparanase, gelatinase) which may enable diapedesis. This protective role of the suben- 
dothelial ECM was observed when the cells were stimulated with soluble factors but not with phagocytosable stimulants. 
[0016] Sixth, intracellular heparanase is secreted within minutes after exposure of T cell lines to specific antigens. 
[0017] Seventh, mitogens (Con A, LPS) induce synthesis and secretion of heparanase by normal T and B lymphocytes 
maintained in vitro. T lymphocyte heparanase is also induced by immunization with antigen in vivo. 
[0018] Eighth, heparanase activity is expressed by pre-B lymphomas and B-lymphomas, but not by plasmacytomas 
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and resting normal B lymphocytes. 

[0019] Ninth, heparanase activity is expressed by activated macrophages during incubation with ECM. but there was 
little or no release of the enzyme into the incubation medium. Similar results were obtained with human myeloid leukemia 
cells induced to differentiate to mature macrophages. 
5 [0020] Tenth, T-cell mediated delayed type hypersensitivity and experimental autoimmunity are suppressed by low 
doses of heparanase inhibiting non-anticoagulant species of heparin (30). 

[0021] Eleventh, heparanase activity expressed by platelets, neutrophils and metastatic tumor cells releases active 
bFGF from ECM and basement membranes. Release of bFGF from storage in ECM may elicit a localized neovascular 
response in processes such as wound healing, inflammation and tumor development. 
10 [0022] Twelfth, among the breakdown products of the ECM generated by heparanase is a tri-sulfated disaccharide 
that can inhibit T-cell mediated inflammation in vivo (31). This inhibition was associated with an inhibitory effect of the 
disaccharide on the production of biologically active TNFa by activated T cells in vitro (31). 

[0023] Other potential ttierapeutic applications: Apart from its involvement in tumor cell metastasis, inflammation 
and autoimmunity, mammalian heparanase may be applied to modulate: bioavailability of heparin-binding growth factors 

15 (15); cellular responses to heparin-binding growth factors (e.g., bFGF, VEGF) and cytokines (IL-8) (31 a, 29); cell 
interaction with plasma lipoproteins (32); cellular susceptibility to certain viral and some bacterial and protozoa infections 
(33, 33a, 33b); and disintegration of amyloid plaques (34). Heparanase may thus prove useful for conditions such as 
wound healing, angiogenesis, restenosis, atherosclerosis, inflammation, neurodegenerative diseases and viral infec- 
tions. Mammalian heparanase can be used to neutralize plasma heparin; as a potential replacement of protamine. 

20 Anti-heparanase antibodies may be applied for immunodetection and diagnosis of micrometastases, autoimmune lesions 
and renal failure in biopsy specimens, plasma samples, and body fluids. Common use in basic research is expected. 
[0024] The identification of the rtpa gene encoding for heparanase enzyme will enable the production of a recombinant 
enzyme in heterologous expression systems. Availability of the recombinant protein will pave the way for solving the 
protein structure function relationship and will provide a tool for developing new inhibitors. 

25 [0025] Viral Infection: The presence of heparan sulfate on cell surfaces have been shown to be the principal require- 
ment for the binding of Herpes Simplex (33) and Dengue (33a) viruses to cells and for subsequent infection of the cells. 
Removal of the cell surface heparan sulfate by heparanase may therefore abolish virus infection. In fact, treatment of 
cells with bacterial heparitinase (degrading heparan sulfate) or heparinase (degrading heparan) reduced the binding of 
two related animal herpes viruses to cells and rendered the cells at least partially resistant to virus infection (33). There 

30 are some indications that the cell surface heparan sulfate is also involved in HIV infection (33b). 

[0026] Neurodegenerative diseases: Heparan sulfate proteoglycans were identified in the prion protein amyloid 
plaques of Genstmann-Straussler Syndrome, Creutzfeldt-Jakob disease and Scrape (34). Heparanase may disintegrate 
these amyloid plaques which are also thought to play a role in the pathogenesis of Alzheimer's disease. 
[0027] Restenosis and Atherosclerosis: Proliferation of arterial smooth muscle cells (SMCs) in response to en- 

35 dothelial injury and accumulation of cholesterol rich lipoproteins are basic events in the pathogenesis of atherosclerosis 
and restenosis (35). Apart from its involvement in SMC proliferation (i.e., low affinity receptors for heparin-binding growth 
factors), HS is also involved in lipoprotein binding, retention and uptake (36). It was demonstrated that HSPG and 
lipoprotein lipase participate in a novel catabolic pathway that may allow substantial cellular and interstitial accumulation 
of cholesterol rich lipoproteins (32). The latter pathway is expected to be highly atherogenic by promoting accumulation 

^0 of apoB and apoE rich lipoproteins (i.e. LDL, VLDL, chylomicrons), independent of feed back inhibition by the cellular 
sterol content. Removal of SMC HS by heparanase is therefore expected to inhibit both SMC proliferation and lipid 
accumulation and thus may halt the progression of restenosis and atherosclerosis. 

[0028] There is thus a widely recognized need for, and it would be highly advantageous to have a polynucleotide 
encoding a polypeptide having heparanase activity, vectors including same, transduced cells expressing heparanase 
45 and a recombinant protein having heparanase activity. 

SUMMARY OF THE INVENTION 

[0029] According to the present invention there is provided a polynucleotide, referred to hereinbelow as hpa, hpa 
50 cDNA or hpa gene, encoding a polypeptide having heparanase activity, vectors including same, transduced cells ex- 
pressing heparanase and a recombinant protein having heparanase activity. 

[0030] Cloning of the human hpa gene which encodes heparanase, and expression of recombinant heparanase by 
transfected host cells is reported. 

[0031] A purified preparation of heparanase isolated from human hepatoma cells was subjected to tryptic digestion 
55 and microsequencing. The YGPDVGQPR (SEQ ID NO:8) sequence revealed was used to screen EST databases for 
homology to the corresponding back translated DNA sequence. Two closely related EST sequences were identified and 
were thereafter found to be identical. Both clones contained an insert of 1020 bp which Included an open reading frame 
of 973 bp followed by a 27 bp of 3' untranslated region and a Poly. A tail. Translation start site was not identified. 
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[0032] Cloning of the missing 5' end of hpa was performed by PGR amplification of DNA from placenta Marathon 
RACE (RIM) cDNA composite using primers selected according to the EST clones sequence and the linkers of the 
composite. A 900 bp PGR fragment, partially overlapping with the identified 3* encoding EST clones was obtained. The 
joined cDNA fragment (hpa), 1721 bp long (SEQ ID NO:9), contained an open readingframe which encodes a polypeptide 

5 of 543 amino acids (SEQ ID NO: 10) with a calculated molecular weight of 61,192 daltons. 

[0033] Cloning an extended. 5' sequence was enabled from the human SK-hepI cell line by PGR amplification using 
the Marathon RAGE (RIM). The 5' extended sequence of the SK-hep 1 hpa cDNA was assembled with the sequence 
of the hpa cDNA isolated from human placenta (SEQ ID NO:9). The assembled sequence contained an open reading 
frame, SEQ ID NOs: 13 and 15. which encodes, as shown in SEQ ID NOs: 14 and 15, a polypeptide of 592 amino acids 

^0 with, a calculated molecular weight of 66,407 daltons. 

[0034] The ability of the hpa gene product to catalyze degradation of heparan sulfate In an in vitro assay was examined 
by expressing the entire open reading frame of hpa in insect cells, using the Baculovirus expression system. Extracts 
and conditioned media of cells infected with virus containing the /rpa gene, demonstrated a high level of heparan sulfate 
degradation activity both towards soluble EGM-derived HSPG and intact ECM. This degradation activity was inhibited 

^5 by heparin, which is another substrate of heparanase. Cells infected with a similar construct containing no hpa gene 
had no such activity, nor did non-infected cells. The ability of heparanase expressed from the extended 5' clone towards 
heparin was demonstrated in a mammalian expression system. 

[0035] The expression pattern of hpa RNA in various tissues and cell lines was investigated using RT-PCR. It was 
found to be expressed only in tissues and cells previously known to have heparanase activity. 
20 [0036] A panel of monochromosomal human/CHO and human/mouse somatic cell hybrids was used to localize the 
human heparanase gene to human chromosome 4. The newly isolated heparanase sequence can be used to identify 
a chromosome region harboring a human heparanase gene in a chromosome spread. 

[0037] According to further features in preferred embodiments of the invention described below, there is provided a 
polynucleotide fragment which includes a polynucleotide sequence encoding a polypeptide having heparanase catalytic 
25 activity. 

[0038] According to still further features in the described preferred embodiments the polynucleotide fragment includes 
nucleotides 63-1691 of SEQ ID NO:9 or nucleotides 139-1869 of SEQ ID. NO:13, which encode the entire human 
heparanase enzyme. 

[0039] According to still further features in the described preferred embodiments there is provided a polynucleotide 
30 fragment which includes a polynucleotide sequence capable of hybridizing with hpa cDNA, especially with nucleotides 
1-721 of SEQIDNO:9. 

[0040] According to still further features in the described preferred embodiments the polynucleotide sequence which 
encodes the polypeptide having heparanase activity shares at least 60 % homology, preferably at least 70 % homology, 
more preferably at least 80 % homology, most preferably at least 90 % homology with SEQ ID NOs:9 or 13. 
35 [0041] According to still further features in the described preferred embodiments the polynucleotide fragment according 
to the present invention includes a portion (fragment) of SEQ ID NOs:9, or 1 3. For example, such fragments could include 
nucleotides 63-721 of SEQ ID NO:9 and/or a segment of SEQ ID NO:9 which encodes a polypeptide having the hepara- 
nase catalytic activity. 

[0042] According to still further features in the described preferred embodiments the polypeptide encoded by the 
40 polynucleotide fragment includes an amino acid sequence as set forth in SEQ ID NOs: 1 0 or 1 4 or a functional part thereof. 
[0043] According to still further features in the described preferred embodiments the polynucleotide sequence encodes 
a polypeptide having heparanase activity, which shares at least 60 % homology, preferably at least 70 % homology, 
more preferably at least 80 % homology, most preferably at least 90 % homology with SEQ ID NOs: 10 or 14. 
[0044] According to still further features in the described preferred embodiments the polynucleotide fragment encodes 
45 a polypeptide having heparanase activity, which may therefore be allelic, species and/or induced variant of the amino 
acid sequence set forth in SEQ ID NOs: 10 or 14. It is understood that any such variant may also be considered a homolog. 
[0045] According to still further features in the described preferred embodiments there is provided a single stranded 
polynucleotide fragment which includes a polynucleotide sequence complementary to at least a portion of a polynucleotide 
strand encoding a polypeptide having heparanase catalytic activity as described above. 
50 [0046] According to still further features in the described preferred embodiments there is provided a vector including 
a polynucleotide sequence encoding a polypeptide having heparanase catalytic activity. 

[0047] The vector may be of any suitable type including but not limited to a phage, virus, plasmid, phagemid, cosmid, 
bacmid or even an artificial chromosome. The polynucleotide sequence encoding a polypeptide having heparanase 
catalytic activity may include any of the above described polynucleotide fragments. 
55 [0048] According to still further features in the described preferred embodiments there Is provided a host cell which 
includes an exogenous polynucleotide fragment including a polynucleotide sequence encoding a polypeptide having 
heparanase catalytic activity. 

[0049] The exogenous polynucleotide fragment may be any of the above described fragments. The host cell may be 



6 



EP 0 998 569 B1 



of any type such as prokaryotic cell, eukaryotic cell, a cell line, or a cell as a portion of a nnulticellular organism (e.g., 
cells of a transgenic organism), 

[0050] According to still further features in the described preferred embodiments there is provided a recombinant 
protein including a polypeptide having heparanase catalytic activity. 

[0051] According to still further features in the described preferred embodiments there is provided a pharmaceutical 

composition comprising as an active ingredient a recombinant protein having heparanase catalytic activity. 

[0052] According to still further features in the described preferred embodiments there is provided a medical equipment 

comprising a medical device containing, as an active ingredient a recombinant protein having heparanase catalytic 

activity. 

[0053] According to still further features in the described preferred embodiments there is provided a heparanase 
overexpression system comprising a cell overexpressing heparanase catalytic activity. 

[0054] According to still further features in the described preferred embodiments there is provided a method of iden- 
tifying a chromosome region harboring a human heparanase gene in a chromosome spread comprising the steps of (a) 
hybridizing the chromosome spread with a tagged polynucleotyde probe encoding heparanase; (b) washing the chro- 
mosome spread, thereby removing excess of non-hybridized probe; and (c) searching for signals associated with said 
hybridized tagged polynucleotyde probe, wherein detected signals being indicative of a chromosome region harboring 
a human heparanase gene. 

[0055] The present invention can be used to develop new dnjgs to inhibit tumor cell metastasis, inflammation and 
autoimmunity. The identification of the hpa gene encoding for heparanase enzyme enables the production of a recom- 
binant enzyme in heterologous expression systems. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0056] The invention herein described, by way of example only, with reference to the accompanying drawings, wherein: 

FIG. 1 presents nucleotide sequence and deduced amino acid sequenceof/ipacDNA. A single nucleotide difference 
at position 799 (A to T) between the EST (Expressed Sequence Tag) and the PGR amplified cDNA (reverse tran- 
scribed RNA) and the resulting amino acid substitution (Tyrto Phe) are indicated above and below the substituted 
unit, respectively. Cysteine residues and the poly adenylation consensus sequence are underlined. The asterisk 
denotes the stop codon TGA. 

FIG. 2 demonstrates degradation of soluble sulfate labeled HSPG substrate by lysates of High Five (RIM) cells 
infected with pFhpa2 virus. Lysates of High Five (RIM) cells that were infected with pF/?pa2 virus (•) or control pF2 
virus (□) were incubated (18 h, 37 °C) with sulfate labeled ECM-derivrd soluble HSPG (peak I). The incubation 
medium was then subjected to gel filtration on Sepharose 6B. Low molecular weight HS degradation fragments 
(peak II) were produced only during incubation with the pFhpal infected cells, but there was no degradation of the 
HSPG substrate (>) by lysates of pF2 infected cells. 

FIGs. 3a-b demonstrate degradation of soluble sulfate labeled HSPG substrate by the culture medium of pFhpa2 
and pF/ipa4 infected cells. Culture media of High Five (RIM) cells infected with pFhpa2 (3a) or pFhpa4 (3b) viruses 
(•), or with control viruses (□) were incubated (18 h, 37 **C) with sulfate labeled ECM-derived soluble HSPG (peak 
I, ^). The incubation media were then subjected to gel filtration on Sepharose 68. Low molecular weight HS 
degradation fragments (peak II) were produced only during incubation with the hpa gene containing viruses. There 
was no degradation of the HSPG substrate by the culture medium of cells infected with control viruses. 
FIG. 4 presents size fractionation of heparanase activity expressed by pFhpal infected cells. Culture medium of 
pFhpa2 infected High Five (RIM) cells was applied onto a 50 kDa cut-off membrane. Heparanase activity (conversion 
of the peak I substrate, (^) into peak II HS degradation fragments) was found in the high (> 50 kDa) (•), but not low 
(< 50 kDa) (o) molecular weight compartment 

FIGs. 5a-b demonstrate the effect of heparin on heparanase activity expressed by pFhpa2 and pFhpaA infected 
High Five (RIM) cells. Culture media oipFhpa2 (5a) and pFhpa4 (5b) infected High Five (RIM) cells were incubeted 
(18 h, 37 °C) with sulfate labeled ECM-derived soluble HSPG (peak. I, ^) in the absence (• )or presence (A) of 10 
M-g/ml heparin. Production of low molecular weight HS degradation fragments was completely abolished in the 
presence of heparin, a potent inhibitor of heparanase activity (6, 7). 

FIGs. 6a-b demonstrate degradation of sulfate labeled intact ECM by virus infected High Five (RIM) and Sf21 Cells. 
High Five (RIM) (6a) and Sf21 (6b) cells were plated on sulfate labeled ECM and infected (48 h, 28 °C) with pF/jpa4 
(•) or control pF1 (□) viruses. Control non-infected Sf21 cells (R) were plated on the labeled ECM as well. The pH 
of the cultured medium was adjusted to 6.0 - 6.2 followed by 24 h incubation at 37 °C. Sulfate labeled material 
released into the incubation medium was analyzed by gel filtration on Sepharose 6B. HS degradation fragments 
were produced only by cells infected with the hpa containing virus. 

FIG. 7a-b demonstrate degradation of sulfate labeled intact ECM by virus infected cells. High Five (RIM) (7a) and 
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Sf21 (7b) cells were plated on sulfate labeled ECM and infected (48 h, 28 °C)with pFhpa4 (•) or control pF1 {□) 
viruses. Control non-infected Sf21 cells (R) were plate on labeled ECM as well. The pH of the cultured medium was 
adjusted to 6.0 - 6.2, followed by 48 h incubation at 28 °C. Sulfate labeled degradation fragments released into the 
incubation medium was analyzed by gel filtration on Sepharose 6B. HS degradation fragments were produced only 

5 by cells infected with the hpa containing virus. 

FIGs. 8a-b demonstrate degradation of sulfate labeled intact ECM by the culture medium of pFhpa4 infected cells. 
Culture media of High Five (RIM) (8a) and Sf21 (8b) cells that were infected with pF/jpa4 (•) or control pF1 (□) 
viruses were incubated (48 h. 37 "C, pH 6.0) with intact sulfate labeled ECM. The ECM was also incubated with the 
culture medium of control non-infected Sf21 cells (R). Sulfate labeled material released into the reaction mixture 

10 was subjected to gel filtration analysis. Heparanase activity was detected only in the culture medium of pFhpa4 

infected cells.. 

FIGs. 9a-b demonstrate the effect of heparin on heparanase activity in the culture medium of pFhpa4 infected cells. 
Sulfate labeled ECM was incubated (24 h, 37 °C, pH 6.0) with culture medium of pfhpa4 infected High Five (RIM) 
*(9a) and Sf2 (9b) cells in the absence (•) or presence (V) of 10 |xg/ml heparin. Sulfate labeled material released 
15 into the incubation medium was subjected to gel filtration on Sepharose 6B. Heparanase activity (production of peak 

M HS degradation fragments) was completely inhibited in the presence of heparin. 

FIGS. 10a-b demonstrate purification of recombinant heparanase on heparin-Sepharose. Culture medium of Sf21 
cells infected with pFhpa4 virus was subjected to heparin-Sepharose chromatography. Elution of fractions was 
performed with 0.3 - 2 M NaCl gradient (^). Heparanase activity in the eluted fractions is demonstrated In Figure 
20 10a (•). Fractions 15-28 were subjected to 15% SDS-polyacrylamide gel electrophoresis followed by silver nitrate 

staining. A correlation is demonstrated between a major protein band (MW ~ 63,000) in fractions 19-24 and hepara- 
nase activity. 

FIGs. Ila-b demonstrate purification of recombinant heparanase on a Superdex 75 gel filtration column. Active 
fractions eluted from heparin-Sepharose (Figure 10a) were pooled, concentrated and applied onto Superdex 75 
25 FPLC column. Fractions were collected and aliquots of each fraction were tested for heparanase activity (C, Figure 

11a) and analyzed by SDS-polyacrylamide gel electrophoresis followed by silver nitrate staining (Figure lib). A 
correlation is seen between the appearance of a major protein band (MW - 63,000) in fractions 4-7 and heparanase 
activity. 

FIGs. 12a-e demonstrate expression of the hpa gene by RT-PCR with total RNA from human embryonal tissues 
30 (12a), human extra-embryonal tissues (12b) and cell lines from different origins (12c-e). RT-PCR products using 

hpa specific primers (I), primers for GAPDH housekeeping gene (II), and control reactions without reverse tran- 
scriptase demonstrating absence of genomic DNA or other contamination in RNA samples (III). M- DNA molecular 
weight marker VI (Boehringer Mannheim). For 12a: lane 1 - neutrophil cells (adult), lane 2 - muscle, lane 3 - thymus, 
lane 4 - heart, lane 5 - adrenal. For 12b: lane 1 - kidney, lane 2 - placenta (8 weeks), lane 3 - placenta (11 weeks), 
35 lanes 4-7 - mole (complete hydatidiform mole), lane 8 - cytotrophoblast cells (freshly isolated), lane 9 - cytotrophoblast 

cells (1 .5 h in vitro), iane 10 - cytotrophoblast cells (6 h in vitro), lane 1 1 - cytotrophoblast cells (18 h /n vitro), lane 
12 - cytotrophoblast cells (48 h in vitro). For 12c: lane 1 - JAR bladder cell line, lane 2 - NCITT testicular tumor cell 
line, lane 3 - SW-480 human hepatoma cell line, lane 4 - HTR (cytotrophoblasts transformed by SV40), lane 5 - 
HPTLP-1 hepatocellular carcinoma cell line, lane 6 - EJ-28 bladder carcinoma cell line. For 12d: lane 1 - SK-hep-1 
40 human hepatoma cell line, lane 2 - DAMI human megakaryocytic cell line, lane 3 - DAMI cell line + PMA, lane 4 - 

CHRF cell line + PMA, lane 5- CHRF cell line. For 12e: lane 1 - ABAE bovine aortic endothelial cells, lane 2- 1063 
human ovarian cell line, lane 3 - human breast carcinoma MDA435 cell line, lane 4 - human breast carcinoma 
MDA231 cell line. 

FIG. 1 3 presents a comparison between nucleotide sequences of the human hpa and a mouse EST cDNA fragment 
45 (SEQ ID NO: 12) which is 80 % homologous to the 3' end (starting at nucleotide 1066 of SEQ ID NO:9) of the human 

hpa. The aligned termination codons are undertined. 

FIG. 14 demonstrates the chromosomal localization of the hpa gene. PCR products of DNA derived from somatic 
cell hybrids and of genomic DNA of hamster, mouse and human of were separated on 0.7 % agarose gel following 
amplification with hpa specific primers. Lane 1 - Lambda DNA digested with SsfEII, lane 2 - no DNA control, lanes 
50 3 - 29, PCR amplification products. Lanes 3-5 - human, mouse and hamster genomic DNA, respectively. Lanes 

6-29, human monochromosomal somatic cell hybrids representing chromosomes 1-22 and X and Y, respectively. 
Lane 30 - Lambda DNA digested with BstEW. An amplification product of approximately 2.8 Kb is observed only in 
lanes 5 and 9, representing human genomic DNA and DNA derived from cell hybrid carrying human chromosome 
4, respectively. These results demonstrate that the Ajpagene is localized in human chromosome 4. 

55 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0057] The present invention is of a polynucleotide, referred to hereinbelow interchangeably as hpa, hpa cDNA or 
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hpa gene, encoding a polypeptide having heparanase activity, vectors including same, transduced cells expressing 
heparanase and a recombinant protein having heparanase activity. 

[0058] Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is 
not limited in its application to the details of construction and the arrangement of the components set forth in the following 
5 description or illustrated in the drawings. 

[0059] Also, it is to be understood that the phraseology and terminology employed herein is for the purpose of description 
and should not be regarded as limiting. 

[0060] The present invention can be used to develop treatments for various diseases, to develop diagnostic assays 
for these diseases and to provide new tools for basic research especially in the fields of medicine and biology. 
10 [0061] Specifically, the present invention can be used to develop new drugs to inhibit tumor cell metastasis, inflam- 
mation and autoimmunity. The identification of the hpa gene encoding for the heparanase enzyme enables the production 
of a recombinant enzyme in heterologous expression systems. 

[0062] Furthermore, the present invention can be used to modulate bioavailability of heparin-binding growth factors, 
cellular responses to heparin-binding growth factors (e.g., bFGF, VEGF)and cytokines (I L-8), cell interaction with plasma 

15 lipoproteins, cellular susceptibility to viral, protozoa and some bacterial infections, and disintegration of neurodegener- 
ative plaques. Recombinant heparanase is thus a potential treatment for wound healing, angiogenesis, restenosis, 
atherosclerosis, inflammation, neurodegenerative diseases (such as, for example, Genstmann-Straussier Syndrome, 
Creutzfeldt-Jakob disease. Scrape and Alzheimer's disease) and certain viral and some bacteria! and protozoa infections. 
Recombinant heparanase can be used to neutralize plasma heparin, as a potential replacement of protamine. 

20 [0063] As used herein, the term "modulate" Includes substantially inhibiting, slowing or reversing the progression of 
a disease, substantially ameliorating clinical symptoms of a disease or condition, or substantially preventing the appear- 
ance of clinical symptoms of a disease or condition. A "modulator" therefore includes an agent which may modulate a 
disease or condition. Modulation of viral, protozoa and bacterial infections includes any effect which substantially inter- 
rupts, prevents or reduces any viral, bacterial or protozoa activity and/or stage of the virus, bacterium or protozoon life 

25 cycle, or which reduces or prevents infection by the virus, bacterium or protozoon in a subject, such as a human or lower 
animal. 

[0064] As used herein, the term "wound" includes any injury to any portion of the body of a subject including, but not 
limited to, acute conditions such as thermal burns, chemical burns, radiation bums, bums caused by excess exposure 
to ultraviolet radiation such as sunburn, damage to bodily tissues such as the perineum as a result of labor and childbirth, 
30 including injuries sustained during medical procedures such as episiotomies, trauma-induced injuries including cuts, 
those injuries sustained in automobile and other mechanical accidents, and those caused by bullets, knives and other 
weapons, and post-surgical injuries, as well as chronic condifions such as pressure sores, bedsores, conditions related 
to diabetes and poor circulation, and all types of acne, etc. 

[0065] Anti-heparanase antibodies, raised against the recombinant enzyme, would be useful for immunodetection 
35 and diagnosis of micrometastases, autoimmune lesions and renal failure in biopsy specimens, plasma samples, and 
body fluids. Such antibodies may also serve as neutralizing agents for heparanase activity. 

[0066] Cloning of the human hpa gene encoding heparanase and expressing recombinant heparanase by transfected 
cells is herein reported. This is the first mammalian heparanase gene to be cloned. 

[0067] A purified preparation of heparanase isolated from human hepatoina cells was subjected to tryptic digestion 
40 and microsequencing. 

[0068] The YGPDVGQPR (SEQ ID NO:8) sequence revealed was used to screen EST databases for homology to 
the con*esponding back translated DNA sequences. Two closely related EST sequences were identified and were 
thereafter found to be identical. 

[0069] Both clones contained an insert of 1020 bp which includes an open reading frame of 973 bp followed by a 3' 
45 untranslated region of 27 bp and a Poly A tail, whereas a translation start site was not identified. 

[0070] Cloning of the missing 5' end was performed by PGR amplification of DNA from placenta marathon RACE 
(RIM) cDNA composite using primers selected according to the EST clones sequence and the linkers of the composite. 
[0071] A 900 bp PCR fi^agment, partially overiapping with the identified 3' encoding EST clones was obtained. The 
joined cDNA fragment (hpa), 1721 bp long (SEQ ID NO:9), contained an open reading frame which encodes, as shown 
50 in Figure 1 and SEQ ID NO:1 1, a polypeptide of 543 amino acids (SEQ ID NO:10) with a calculated molecular weight 
of 61,192 daltons. 

[0072] A single nucleotide difference at position 799 (A to T) between the EST clones and the PCR amplified cDNA 
was observed. This difference results in a single amino acid substitution (Tyr to Phe) (Figure 1). Furthermore, the 
published EST sequences contained an unidentified nucleotide, which following DNA sequencing of both the EST clones 
55 was resolved into two nucleotides (G and C at positions 1630 and 1631 in SEQ ID NO:9, respectively). 

[0073] The ability of the hpa gene product to catalyze degradation of heparan sulfate in an in vitro assay was examined 

by expressing the entire open reading frame in insect cells, using the Baculovirus expression system. 

[0074] Extracts and conditioned media of cells infected with virus containing the hpa gene, demonstrated a high level 
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of heparan sulfate degradation activity both towards soluble ECM-derived HSPG and intact ECM, which was inhibited 
by heparin, while cells infected with a similar construct containing no hpa gene had no such activity, nor did non-infected 
cells. 

[0075] The expression pattern of hpa RNA in various tissues and cell lines was investigated using RT-PCR. It was 
5 found to be expressed only in tissues and cells previously known to have heparanase activity. 

[0076] Cloning an extended 5' sequence was enabled from the human SK-hep1 cell line by PGR amplification using 
the Marathon RACE (RIM) The 5' extended sequence of the SK-hep1 hpa cDNA was assembled with the sequence of 
the hpa cDNA isolated from human placenta (SEQ ID NO:9). The assembled sequence contained an open reading 
frame, SEQ ID NOs: 13 and 15, which encodes, as shown in SEQ ID NOs:14 and 15, a polypeptide of 592 amino acids. 
10 with a calculated molecular weight of 66,407 daltons. This open reading frame was shown to direct the expression of 
cataiitically active heparanase In a mammalian cell expression system. The expressed heparanase was detectable by 
anti heparanase antibodies in Westem blot analysis. 

[0077] A panel of monochromosomal human/CHO and human/mouse somatic cell hybrids was used to localize the 
human heparanase gene to human chromosome 4. The newly Isolated heparanase sequence can therefore be used to 
IS identify a chromosome region harboring a human heparanase gene in a chromosome spread. 

[0078] Thus, according to the present invention there is provided a polynucleotide fragment (either DNA or RNA, either 
single stranded or double stranded) which includes a polynucleotide sequence encoding a polypeptide having heparanase 
catalytic activity. 

[0079] The term "heparanase catalytic activity" or its equivalent term "heparanase activity" both refer to a mammalian 
20 endoglycosidase hydrolyzing activity which is specific for heparan or heparan sulfate proteoglycan substrates, as opposed 
to the activity of bacterial enzymes (heparinase I, II and III) which degrade heparin or heparan sulfate by means of 
^-elimination (37). 

[0080] In a preferred embodiment of the invention the polynucleotide fragment includes nucleotides 63-1691 of SEQ 
ID NO:9, or nucleotides 139-1869 of SEQ ID NO: 13, which encode the entire human heparanase enzyme. 
25 [0081] However, the scope of the present invention is not limited to human heparanase since this is the first disclosure 
of an open reading frame (ORF) encoding any mammalian heparanase. Using the hpa cDNA, parts thereof or synthetic 
oligonucleotides designed according to its sequence will enable one ordinarily skilled in the art to identify genomic and/or 
cDNA clones including homologous sequences from other mammalian species. 

[0082] The present invention is therefore further directed at a polynucleotide fragment which includes a polynucleotide 
30 sequence capable of hybridizing (base pairing under eitherstringent or permissive hybridization conditions, asfor example 
described inSambrook, J.; Fritsch, E.F., Maniatis.T. (1989) MolecularCtoning. A Laboratory Manual. Cold Spring Harbor 
Laboratory Press, New York.) with hpa cDNA, especially with nucleotides 1-721 of SEQ ID NO:9. 
[0083] In fact, any polynucleotide sequence which encodes a polypeptide having heparanase activity and which shares 
at least 60 % homology, preferably at least 70 % homology, more preferably at least 80 % homology, most preferably 
35 at least 90 % homology with SEQ ID NOs:9 or 13 is within the scope of the present invention. 

[0084] The polynucleotide fragment according to the present invention may include any part of SEQ ID NOs: 9 or 13. 
For example, it may include nucleotides 63-721 of SEQ ID NO:9, which is a novel sequence. However, it may include 
any segment of SEQ ID NOs:9 or 13 which encodes a polypeptide having the heparanase catalytic activity. 
[0085] When the phrase "encodes a polypeptide having heparanase catalytic activity" is used herein and in the claims 
40 section below it refers to the ability of directing the synthesis of a polypeptide which, if so required for its activity, following 
post translational modifications, such as but not limited to, proteolysis (e.g., removal of a signal peptide and of a pro- or 
preprotein sequence), methionine modification, glycosylation, alkylation (e.g., methylation), acetylation, etc., is catalyt- 
ically active in degradation of, for example, ECM and cell surface associated HS. 

[0086] In a preferred embodiment of the invention the polypeptide encoded by the polynucleotide fragment includes 
45 an amino acid sequence as set forth in SEQ ID NOs:10 or 14 or a functional part thereof, i.e., a portion harboring 
heparanase catalytic activity. 

[0087] However, any polynucleotide fragment which encodes a polypeptide having heparanase activity Is within the 
scope of the present invention. Therefore, the polypeptide may be allelic, species and/or induced variant of the amino 
acid sequence set forth in SEQ ID NOs: 10 or 14 or functional part thereof. 
50 [0088] In fact, any polynucleotide sequence which encodes a polypeptide having heparanase activity, which shares 
at least 60 % homology, preferably at least 70 % homology, more preferably at least 80 % homology, most preferably 
at least 90 % homology with SEQ ID NOs: 10 or 14 is within the scope of the present invention, 

[0089] The invention is also directed at providing a single stranded polynucleotide fragment which includes a polynu- 
cleotide sequence complementary to at least a portion of a polynucleotide strand encoding a polypeptide having hepara- 
ns nase catalytic activity as described above. The term "complementary" as used herein refers to the ability of base pairing. 
[0090] The single stranded polynucleotide fragment may be DNA or RNA or even include nucleotide analogs (e.g., 
thioated nucleotides), it may be a synthetic oligonucleotide or manufactured by transduced host cells, it may be of any 
desired length which still provides specific base pairing (e.g., 8 or 10, preferably more, nucleotides long) and it may 
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include mismatches that do not hamper base pairing. 

[0091] The invention is further directed at providing a vector which includes a polynucleotide sequence encoding a 
polypeptide having heparanase catalytic activity. 

[0092] The vector may be of any type. It may be a phage which infects bacteria or a virus which Infects eukaryotic 
5 cells. It may also be a plasmid, phagemid, cosmid, bacmid or an artificial chromosome. The polynucleotide sequence 
encoding a polypeptide having heparanase catalytic activity may include any of the above described polynucleotide 
fragments, 

[0093] The invention is further directed at providing a host cell which includes an exogenous polynucleotide fragment 

encoding a polypeptide having heparanase catalytic activity. 
10 [0094] The exogenous polynucleotide fragment may be any of the above described fragments. The host cell may be 

of any type. It may be a prokaryotic cell, an eukaryotic cell, a cell line, or a cell as a portion of an organism. The exogenous 

polynucleotide fragment may be permanently or transiently present in the cell. In other words, transduced cells obtained 

following stable or transient transfection, transformation or transduction are all within the scope of the present invention. 

The term "exogenous" as used herein refers to the fact that the polynucleotide fragment is externally introduced into the 
IS cell. Therein it may be present in a single of any number of copies, it may be integrated into one or more chromosomes 

at any location or be present as an extrachromosomal material. 

[0095] The invention is further directed at providing a heparanase overexpression system which includes a cell over- 
expressing heparanase catalytic activity. The cell may be a host cell transiently or stably transfected or transformed with 
any suitable vector which includes a polynucleotide sequence encoding a polypeptide having heparanase activity and 

20 a suitable promoter and enhancer sequences to direct overexpression of heparanase. However, the overexpressing cell 
may also be a product of an insertion (e.g.. via homologous recombination) of a promoter and/or enhancer sequence 
upstream to the endogenous heparanase gene of the expressing cell, which will direct overexpression from the endog- 
enous gene. The term "overexpression" as used herein in the specification and claims below refers to a level of expression 
which is higher than a basal level of expression typically characterizing a given cell under otherwise identical conditions, 

25 [0096] The invention is further directed at providing a recombinant protein including a polypeptide having heparanase 
catalytic activity. 

[0097] The recombinant protein may be purified by any conventional protein purification procedure close to homoge- 
neity and/or be mixed with additives. The recombinant protein may be manufactured using any of the cells described 
above. The recombinant protein may be in any form. It may be in a crystallized form, a dehydrated powder form or in 
30 solution. The recombinant protein may be useful in obtaining pure heparanase, which in turn may be useful in eliciting 
antl-heparanase antibodies, either poly or monoclonal antibodies, and as a screening active ingredient in an anti-hepara- 
nase inhibitors or drugs screening assay or system. 

[0098] The invention is further directed at providing a pharmaceutical composition which include as an active ingredient 
a recombinant protein having heparanase catalytic activity. 

35 [0099] Formulations for topical administration may include, but are not limited to, lotions, ointments, gels, creams, 
suppositories, drops, liquids, sprays and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, 
thickeners and the like may be necessary or desirable. Coated condoms, stents, active pads, and other medical devices 
may also be useful. In fact the scope of the present invention includes any medical equipment such as a medical device 
containing, as an active ingredient, a recombinant protein having heparanase catalytic activity. 

40 [0100] Compositions for oral administration include powders or granules, suspensions or solutions in water or 
non-aqueous media, sachets, capsules or tablets. Thickeners, diluents, flavorings, dispersing aids, emulsifiers or binders 
may be desirable. 

[0101] Formulations for parenteral administration may include, but are not limited to; sterile aqueous solutions which 
may also contain buffers, diluents and other suitable additives. 
45 [0102] Dosing is dependent on severity and responsiveness of the condition to be treated, but will normally be one or 
more doses per day, with course of treatment lasting from several days to several months or until a cure is effected or 
a diminution of disease state is achieved. Persons ordinarily skilled in the art can easily determine optimum dosages, 
dosing methodologies and repetition rates, 

[0103] Further according to the present invention there is provided a method of identifying a chromosome region 
50 harboring a human heparanase gene in a chromosome spread, the method is executed implementing the following 
method steps, in which in a first step the chromosome spread (either interphase or metaphase spread) is hybridized 
with a tagged polynucleotyde probe encoding heparanase. The tag is preferably a fluorescent tag. In a second step 
according to the method the chromosome spread is washed, thereby excess of non-hybridized probe is removed. Finally, 
signals associated with the hybridized tagged polynucleotyde probe are searched for, wherein detected signals being 
55 indicative of a chromosome region harboring the human heparanase gene. One ordinarily skilled in the art would know 
how to use the sequences disclosed herein in suitable labeling reactions and how to use the tagged probes to detect, 
using in situ hybridization, a chromosome region harboring a human heparanase gene, 

[0104] Reference is now made to the following examples, which together with the above descriptions, illustrate the 
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invention in a non-limiting fashion. 
EXAMPLES 

5 [0105] The following protocx)ls and experimental details are referenced in the Examples that follow: 

[0 1 06] Purification and characterization ofheparanase from a human hepatoma celi iine and human piacenta: 

A human hepatoma cell line (Sk-hep-1 ) was chosen as a source for purification of a human tumor-derived heparanase. 
Purification was essentially as described in U.S. Pat. No. 5,362,641 to Fuks. 

[01 07] Briefly, 500 liter, 5x1 0^ ^ cells were grown in suspension and the heparanase enzyme was purified about 240,000 

10 fold by applying the following steps: (i) cation exchange (CM-Sephadex) chromatography performed at pH 6.0, 0.3-1 .4 
M NaCI gradient; (ii) cation exchange (CM-Sephadex) chromatography performed at pH 7.4 in the presence of 0.1.% 
CHAPS, 0.3-1.1 M NaCI gradient; (iii) heparin-Sepharose chromatography performed at pH 7.4 in the presence of 0.1 % 
CHAPS, 0.35-1.1 M NaCI gradient; (iv) ConA-Sepharose chromatography performed at pH 6.0 in buffer containing 0.1 
% CHAPS and 1 M NaCI, elution with 0.25 M a-methyl mannoside; and (v) HPLC cation exchange (Mono-S) chroma- 

15 tography performed at pH 7.4 in the presence of 0.1 % CHAPS, 0.25-1 M NaCI gradient. 

[0108] Active fractions were pooled, precipitated with TCA and the precipitate subjected to SDS polyacrylamide gel 
electrophoresis and/or tryptic digestion and reverse phase HPLC. Tryptic peptides of the purified protein were separated 
by reverse phase HPLC (C8 column) and homogeneous peaks were subjected to amino acid sequence analysis. 
[0109] The purified enzyme was applied to reverse phase HPLC and subjected to N-terminal amino acid sequencing 

20 using the amino acid sequencer (Applied Biosystems). 

[0110] Cells: Cultures of bovine corneal endothelial cells (BCECs) were established from steer eyes as previously 
described (19, 38). Stock cultures were maintained in DMEM (1 g glucose/liter) supplemented with 10 % newborn calf 
serum and 5 % FCS. bFGF (1 ng/ml) was added every other day during the phase of active cell growth (13,14). 
[01 1 1] Preparation of dishes coated with ECM: BCECs (second to fifth passage) were plated into 4-well plates at 

25 an initial density of 2 x 10^ cells/ml, and cultured in sulfate-free Fisher medium plus 5 % dextran T-40 for 12 days 
Na2^^S04 (25 luiCi/ml) was added on day 1 and 5 after seeding and the cultures were incubated with the label without 
medium change. The subendothelial ECM was exposed by dissolving (5 min., room temperature) the cell layer with PBS 
containing 0.5 % Triton. X-100 and 20 mM NH4OH, followed by four washes with PBS. The ECM remained intact; free 
of cellular debris and firmly attached to the entire area of the tissue culture dish (19, 22). 

30 [0112] To prepare soluble sulfate labeled proteoglycans (peak I material), the ECM was digested with trypsin (25 
jjig/ml, 6 h, 37 **C), the digest was concentrated by reverse dialysis and the concentrated material was applied onto a 
Sepharose 6B gel filtration column. The resulting high molecular weight material (Kav< 0.2, peak I) was collected. More 
. than 80 % of the labeled material was shown to be composed ofheparan sulfate proteoglycans (11, 39). 

[0113] Heparanase activity: Cells (1 x 106/35-mm dish), cell lysates or conditioned media were incubated on top 

35 of 35s-labeled ECM (18 h, 37 °C) in the presence of 20 mM phosphate buffer (pH 6.2). Cell lysates and conditioned 
media were also incubated with sulfate labeled peak I material (10-20 The incubation medium was collected, cen- 
trifuged (18,000 x g, 4 "C, 3 min.), and sulfate labeled material analyzed by gel filtration on a Sepharose CL-6B column 
(0.9 X 30 cm). Fractions (0.2. ml) were eluted with PBS at a flow rate of 5 ml/h and counted for radioactivity using Bio-fluor 
scintillation fluid. The excluded volume (Vq) was marked by blue dextran and the total included volume (Vt) by phenol 

4a red. The latter was shown to comigrate with free sulfate (7, 11, 23). Degradation fragments of HS side chains were 
eluted from Sepharose 6B at 0.5 < Kav < 0.8 (peak 11) (7, 11, 23). A neariy intact HSPG released from ECM by trypsin 
- and, to a lower extent, during incubation with PBS alone - was eluted next to (Kav < 0.2, peak I). Recoveries of 
labeled material applied on the columns ranged from 85 to 95 % in different experiments (11). Each experiment was 
performed at least three times and the variation of elution positions (Kav values) did not exceed +/- 15 %. 

45 [0114] Cloning ofhpa cDNA: cDNA clones 257548 and 2601 38 were obtained from the I.M.A.G.E Consortium (21 30 
Memorial Parkway SW, Hunstville, AL 35801). The cDNAs were originally cloned in EcoR\ and A/ofl cloning sites in the 
piasmid vector pT3T7D-Pac. Although these clones are reported to be somewhat different, DNA sequencing demon- 
strated that these clones are identical to one another. Marathon RACE (RIM) (rapid amplification of cDNA ends) human 
placenta (poly-A) cDNA composite was a gift of Prof Yossi Shiloh of Tel Aviv University.. This composite is vector free, 

50 as it includes reverse transcribed cDNA fragments to which double, partially single stranded adapters are attached on 
both sides. The construction of the specific composite employed is described in reference 39a. 
[0115] Amplification of hp3 PCR fragment was performed according to the protocol provided by Clontech laboratories. 
The template used for amplification was a sample taken from the above composite. The primers used for amplification 
were: 

55 
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First step: 5'-prinien API: y-CCATCCTAATACGACTCACTATAGGG 
C-3\ SEQ CD NO:l; 3*-primen HPL229: 5'-GTAGTGATGCeATGTAACTGA 
5 ATC-3'.. SEQ ID NO:2. 



[0116] Second step: nested 5'-primer AP2: 5'-ACTCACTATAGGGCTCGAGCG GC-3', SEQ ID NO:3; nested 3'- prim- 
to er: HPL171 : 5'-GCATCTTAGCCGTCT TTCTTCG-3', SEQ ID NO:4. The HPL229 and HPL 171 were selected according 

to the sequence of the EST clones. They include nucleotides 933-956 and 876-897 of SEQ ID NO:9, respectively. 

[0117] PGR program was 94 °C - 4 min., followed by 30 cycles of 94 ''C - 40 sec. 62 °C - 1 min., 72°C - 2.5 min. 

Amplification was performed with Expand High Fidelity (Boehringer Mannheim). The resulting ca. 900 bp hp3 PGR 

product was, digested with Bfrl and Pvu\\. Clone 257548 (phpa^ )yNas digested with EcoRI, followed by end filling and 
15 was then further digested with Bfrl. Thereafter the Pv^ull - Sfrl fragment of the hp3 PGR product was cloned into the blunt 

end - Sfrl end of clone phpa^ which resulted in having the entire cDNA cloned in pT3T7-pac vector, designated phpa2. 

[0118] DMA Sequencing: Sequence determinations were performed with vector specific and gene specific primers, 

using an automated DNA sequencer (Applied Biosystems, model 373A). Each nucleotide was read from at least two 

independent primers. 

20 [0119] Computer analysis of sequences: Database searches for sequence similarities were performed using the 
Blast network sen/ice. Sequence analysis and alignment of DNA and protein sequences were done using the DNA 
sequence analysis software package developed by the Genetic Computer Group (GCG) at the University of Wisconsin. 
[0120] RT'PCR: RNAwas prepared using TRI-Reagent (Molecular research center Inc.) according to the manufacturer 
instructions. 1 .25 ^.g were taken for reverse transcription reaction using MuMLV Reverse transcriptase (Gibco BRL) and 

25 Oligo (dT)i5 primer, SEQ ID NO:5, (Promega). Amplification of the resultant first strand cDNA was performed with Taq 
polymerase (Promega). The following primers were used: 

HPU-355: 5'-TTCGATCCCAAGAAGGAATCAAC-3', SEQ ID NO:6, nucleotides 372-394 in SEQ ID NO:9 or 11. 
HPL-229: 5'-GTAGTGATGCCATGTAACTGAATC-3\ SEQ ID NO:7, nucleotides 933-956 in SEQ ID NO:9 or 1 1. 
30 PGR program: 94 "C - 4 min., followed by 30 cycles of 94°C - 40 sec, 62'*C - 1 min., 72°C - I min. 

[0121] Expression of recombinant tieparanase in insect cells: Cells, High Five (RIM) and Sf21 insect cell lines 
were maintained as monolayer cultures in SF900II-SFM medium (GibcoBRL). 

[0122] Recombinant Baculovirus: Recombinant virus containing the hpa gene was constnjcted using the Bac to 
35 Bac system (GibcoBRL). The transfer vector pFastBac was digested with Sa/I and A/ofI and ligated with a 1 -7 kb fragment 
of phpa2 digested withX/7ol and Not. The resulting plasmid was designated pFastA)pa2. An identical plasmid designated 
pFast/7pa4 was prepared as a duplicate and both independently served for further experimentations. Recombinant 
bacmid was generated according to the instructions of the manufacturer with pFast/7pa2, pFastA7pa4 and with pFastBac. 
The latter served as a negative control. Recombinant bacmid DNAs were transfected into Sf21 insect cells. Five days 
40 after transfection recombinant viruses were harvested and used to infect High Five (RIM) cells. 3 x 10^ cells in T-25 
flasks. Cells were han/ested 2-3 days after infection. 4 x 10^ cells were centrifuged and resuspended in a reaction 
buffer containing 20 mM phosphate citrate buffer, 50 mM NaCI. Cells underwent three cycles of freeze and thaw and 
lysates were stored at -80 °C. Conditioned medium was stored at 4 °C. 

[0123] Partial purification of recombinant heparanase: Partial purification of recombinant heparanase was per- 
45 formed by heparin-Sepharose column chromatography followed by Superdex 75 column gel filtration. Culture medium 
(150 ml) of Sf21 cells infected with pFhpa4 virus was subjected to heparin-Sepharose chromatography. Elution of 1 ml 
fractions was performed with 0.35 - 2 M NaCI gradient in presence of 0.1 % CHAPS and 1 mM DTT in 10 mM sodium 
acetate buffer, pH 5.0. A 25 \l\ sample of each fraction was tested for heparanase activity. Heparanase activity was 
eluted at the range of 0.65 - 1.1 M NaCI (fractions 18-26, Figure 10a). 5 ^,1 of each fraction was subjected to 15 % 
50 SDS-polyacrylamide gel electrophoresis followed by silver nitrate staining. Active fractions eluted from heparin-Sepha- 
rose (Figure 10a) were pooled and concentrated (x 6) on YM3 cut-off membrane. 0.5 ml of the concentrated material 
was applied onto 30 ml Superdex 75 FPLC column equilibrated with 10 mM sodium acetate buffer, pH 5.0, containing 
0.8 M NaCI, 1 mM DTT and 0.1 % CHAPS. Fractions (0.56 ml) were collected at a flow rate of 0.75 ml/min. Aliquots of 
each fraction were tested for heparanase activity and were subjected to SDS-polyacrylamide gel electrophoresis followed 
55 by silver nitrate staining (Figure 1 1 b). 
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EXAMPLE 1 

Cloning of the hpa gene 

s [0124] Purified fraction of heparanase isolated from human hepatoma cells (SK-hep-1) was subjected to tryptic di- 
gestion and microsequencing. EST (Expressed Sequence Tag) databases were screened for homology to the back 
translated DNA sequences corresponding to the obtained peptides. Two EST sequences (accession Nos. N41349 and 
N45367) contained a DNA sequence encoding the peptide YGPDVGQPR (SEQ ID N0:8). These two sequences were 
derived from clones 257548 and 260138 (I.M.A.G.E Consortium) prepared from 8 to 9 weeks placenta cDNA library 

10 (Scares). Both clones which were found to be identical contained an insert of 1020 bp which included an open reading 
frame (ORF) of 973 bp followed by a 3' untranslated region of 27 bp and a Poly A tail. No translation start site (AUG) 
was identified at the 5' end of these clones. 

[0125] Cloning of the missing 5* end was performed by PCR amplification of DNA from a placenta Marathon RACE 
(RIM) cDNA composite. A 900 bp fragment (designated hp3), partially overtapping with the identified 3* encoding EST 
15 clones was obtained 

[0126] The joined cDNA fragment, 1721 bp long (SEQ ID NO:9), contained an open reading frame which encodes, 
as shown in Figure 1 and SEQ ID NO: 1 1 , a polypeptide of 543 amino acids (SEQ ID N0:1 0) with a calculated molecular 
weight of 61,192 daltons. The 3' end of the partial cDNA inserts contained in clones 257548 and,260138 started, at 
nucleotide G^^i of SEQ ID N0:9 and Figure 1 . 
20 [0127] As further shown in Figure 1 , there was a single sequence discrepancy between the EST clones and the PCR 
amplified sequence, which led to an amino acid substitution from Tyr246 jp the EST to Phe^^e in the amplified cDNA. 
The nucleotide sequence of the PCR amplified cDNAfragment was verified from two independent amplification products. 
The new gene was designated hpa. 

[0128] As stated above, the 3* end of the partial cDNA inserts contained in EST clones 257548 and 260138 started 
25 at nucleotide 721 of hpa (SEQ ID NO:9). The ability of the hpa cDNA to form stable secondary structures, such as stem 
and loop structures involving nucleotide stretches inthe vicinity of position 721 was investigated using computer modeling. 
It was found that stable stem and loop structures are likely to be formed involving nucleotides 698-724 (SEQ ID NO:9). 
In addition, a high GC content, up to 70 %, characterizes the 5' end region of the hpa gene, as compared to about only 
40 % in the 3' region. These findings may explain the immature termination and therefore lack of 5' ends in the EST clones. 
30 [0129] To examine the ability of the hpa gene product to catalyze degradation of heparan sulfate in an in vitro assay 
the entire open reading frame was expressed in insect cells, using the Baculovirus expression system. Extracts of cells, 
infected with virus containing the hpa gene, demonstrated a high level of heparan sulfate degradation activity, while 
cells infected with a similar construct containing no hpa gene had no such activity, nor did non-infected cells. These 
results are further demonstrated in the following Examples. 

35 

EXAMPLE 2 

Degradation of soluble ECM-derived HSPG 

40 [0130] Monolayer cultures of High Five (RIM) cells were infected (72 h, 28 °C) with recombinant Bacoluvirus containing 
the pFasthpa plasmid or with control virus containing an insert free plasmid. The cells were harvested and lysed in 
heparanase reaction buffer by three cycles of freezing and thawing. The cell lysates were then incubated (18 h, 37 °C) 
with sulfate labeled, ECM-derived . HSPG (peak 1), followed by gel filtration analysis (Sepharose 6B) of the reaction 
mixture. 

45 [0131] As shown in Figure 2, the substrate alone included almost entirely high molecular weight (Mr) material eluted 
next to Vq (peak I, fractions 5-20, Kav < 0.35). A similar elution pattern was obtained when the HSPG substrate was 
incubated with lysates of cells that were infected with control virus. In contrast, incubation of the HSPG substrate with 
lysates of cells infected with the hpa containing virus resulted in a complete conversion of the high Mr substrate into low 
Mr labeled degradation fragments (peak II, fractions 22-35, 0.5 < Kav < 0.75). 

50 [0132] Fragments eluted in peak II were shown to be degradation products of heparan sulfate, as they were (i) 5- to 
6-fold smaller than intact heparan sulfate side chains (Kav approx. 0.33) released from ECM by treatment with either 
alkaline twrohydride or papain; and (ii) resistant to further digestion with papain or chondroitinase ABC, and susceptible 
to deamination by nitrous acid (6, 11). 

[0133] Similar results (not shown) were obtained with Sf21 cells. Again, heparanase activity was detected in cells 
55 infected with the hpa containing virus (pFhpa), but not with control virus (pF). This result was obtained with two inde- 
pendently generated recombinant viruses. Lysates of control not infected High Five (RIM) cells failed to degrade the 
HSPG substrate. 

[0134] In subsequent experiments, the labeled HSPG substrate was incubated with medium conditioned by infected 
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High Five (RIM) or Sf21 cells. 

[0135] As shown in Figures 3a-b, heparanase activity, reflected by the conversion of the high Mr peak I substrate into 
the low Mr peak II which represents HS degradation fragments, was found in the culture medium of cells infected with 
the pFhpa2 or pF hpa4 viruses, but not with the control pF1 or pF2 viruses. No heparanase activity was detected in the 

5 culture medium of control non-infected High Five (RIM) or Sf21 cells. 

[0136] The medium of cells infected with the pFA)pa4 virus was passed through a 50 kDa cut off membrane to obtain 
a crude estimation of the molecular weight of the recombinant heparanase enzyme. As demonstrated in Figure 4, all 
the enzymatic activity was retained in the upper compartment and there was no activity in the flow through (<50 kDa) 
material. This result is consistent with the expected molecular weight of the hpa gene product. 

10 [0137] In order to further characterize the hpa product the inhibitory effect of heparin, a potent inhibitor of heparanase 
mediated HS degradation (40) was examined. 

[0138] As demonstrated in Figures 5a-b, conversion of the peak I substrate into peak II HS degradation fragments 
was completely abolished in the presence of heparin. 

[0139] Altogether, these results indicate that the heparanase enzyme is expressed in an active form by insect cells 
15 infected with Baculovirus containing the newly identified human hpa, gene. 

EXAMPLE 3 

Degradation of HSPG in intact ECM 

20 

[0140] Next, the ability of intact infected insect cells to degrade HS in intact, naturally produced ECM was investigated. 
For this purpose. High Five (RIM) or Sf21 cells were seeded on metabolically sulfate labeled ECM followed by infection 
(48 h, 28 *'C) with either the pF/7pa4 or control pF2 viruses. The pH of the medium was then adjusted to pH 6.2-6.4 and 
the cells further incubated with the labeled ECM for another 48 h at 28 '^C or 24 h at 37 ^'C. Sulfate labeled material 

25 released into the incubation medium was analyzed by gel filtration on Sepharose SB. 

[0141] As shown in Figures- 6a-b and 7a-b, incubation of the ECM with cells infected with the control pF2 virus resulted 
in a constant release of labeled material that consisted almost entirely (>90%) of high Mr fragments (peak I) eluted with 
or next to Vq. It was previously shown that a proteolytic activity residing in the ECM itself and/or expressed by cells is 
responsible for release of the high Mr material (6). This neariy intact HSPG provides a soluble substrate for subsequent 

30 degradation by heparanase, as also indicated by the relatively large amount of peak I material accumulating when the 
heparanase enzyme is inhibited by heparin (6, 7, 12, Figure 9). On the other hand, incubation of the labeled ECM with 
cells infected with the pFA7pa4 virus resulted in release of 60-70% of the ECM-associated radioactivity in the form of low 
Mr sulfate-labeled fragments (peak II, 0.5 <Kav< 0.75), regardless of whether the infected cells were incubated with the 
ECM at 28 °C or 37 °C. Control intact non-infected Sf21 or High Five (RIM) cells failed to degrade the ECM HS side chains. 

35 [0142] In subsequent experiments, as demonstrated in Figs. 8a-b, High Five (R|M) and Sf21 cells were infected (96 
h, 28 °C).with pF/7pa4 or control pF1 viruses and the culture medium incubated with sulfate-labeled ECM. Low Mr HS 
degradation fragments werejeleased from the ECM only upon incubation with mediurh conditioned by pFhpa4 infected 
cells. As shown in Figure 9, production of these fragments was abolished in the presence of heparin. No heparanase 
activity was detected in the culture medium of control, non-infected cells. These results indicate that the Heparanase 

40 enzyme expressed by cells infected with the pFhpa4 virus is capable of degrading HS when complexed to other mac- 
romolecular constituents (i.e. fibronectin, laminin, collagen) of a naturally produced intact ECM, in a manner similar to 
that reported for highly metastatic tumor cells or acfivated cells of the immune system (6; 7). 

EXAMPLE 4 

45 

Purification of recombinant heparanase 

[0143] The recombinant heparanase was partially purified from medium of pF/)pa4 infected Sf21 1 cells by 
Heparin-Sepharose chromatography (Figure 10a) followed by gel filtration of the pooled active fractions over an FPLC 
50 Superdex 75 column (Figure 11a). A - 63 kDa protein was observed, whose quantity, as was detected by silver stained 
SDS-polyacrylamide gel electrophoresis, correlated with heparanase activity in the relevant column fractions (Figures 
1 0b and 1 1 b, respectively). This protein was not detected in the culture medium of cells infected with the control pFI virus 
and was subjected to a similar fractionation on heparin-Sepharose (not shown). 

55 
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EXAMPLE 5 

Expression of the hpa gene in various cell types, organs and tissues 

5 ' [0144] Referring now to Figures 12a-e, RT-PCR was applied to evaluate the expression of the hpa gene by various 
cell types and tissues. For this purpose, total RNA was reverse transcribed and amplified. The expected 585 bp long 
cDNA was cleariy dennonstrated In human kidney, placenta (8 and 11 weeks) and mole tissues, as well as in freshly 
isolated and short termed (1 .5-48 h) cultured human placental cytotrophoblastic cells (Figure 12a), all known to express 
a high heparanase activity (41). The hpa transcript was also expressed by normal human neutrophils (Figure 12b). In 

10 contrast, there was no detectable expression of the hpa mRNA in embryonic human muscle tissue, thymus, heart and 
adrenal. (Figure 12b). The hpa gene was expressed by several, but not all, human bladder carcinoma cell lines (Figure 
12c), SK hepatoma (SK-hep-1), ovarian carcinoma (OV 1063), breast carcinoma (435, 231), melanoma and megakary- 
ocytic (DAMI, CHRF) human cell lines (Figures 12d-e). 

[0145] The above described expression pattern of the hpa transcript was determined to be in a very good correlation 
15 with heparanase activity levels determined in various tissues and cell types (not shown). 

EXAMPLE 6 

hpa homologous genes 

20 

[0146] EST databases were screened for sequences homologous to the hpa gene. Three mouse ESTs were identified 
(accession No. Aal 77901, from mouse spleen, Aa067997 from mouse skin, Aa47943 from mouse embryo), assembled 
into a 824 bp cDNA fragment which contains a partial open reading frame (lacking a 5' end) of 629 bp and a 3' untranslated 
region of 195 bp (SEQ ID NO:12). As shown in Figure 13, the coding region is 80% similar to the 3' end of the hpacDHA 

25 sequence. These ESTs are probably cDNA fragments of the mouse hpa homolog that encodes for the mouse heparanase. 
[0147] Searching for consensus protein domains revealed an amino terminal homology between the heparanase and 
several precursor proteins such as Procollagen Alpha 1 precursor, Tyrosine-protein kinase-RYK, FibuIin-1, Insulin-like 
growth factor binding protein and several others. The amino terminus is highly hydrophobic and contains a potential 
trans-membrane domain. The homology to known signal peptide sequences suggests that it could function as a signal 

30 peptide for protein localization. 

EXAMPLE 7 

Isolation of an extended 5' end of hpa cDNA from human SK-hepI cell line 

35 

[0148] The 5' end of hpa cDNA was isolated from human SK-hepI cell line by PGR amplification using the Marathon 
RACE (RIM) (rapid amplification of cDNA ends) kit (Clontech). Total RNA was prepared from SK-hepi cells using the 
TRI-Reagent (Molecular research center Inc.) according to the manufacturer instructions. Poly A+ RNA was isolated 
using the mRNA separator kit (Clonetech). 

40 [0149] The Marahton RACE SK-hep1 cDNA composite was constructed according to the manufacturer recommen- 
dations. First round of amplification was performed using an adaptor specific primer AP1: 5'-CCATCCTAATACG ACT- 
CACTATAGGGC-3', SEQ ID NO:1 , and a hpa specific antisense primer hpl-629: 5'-CCCCAGGAGCAGCAGCATCAG- 
3', SEQ ID NO: 17, corresponding to nucleotides 1 19-99 of SEQ ID NO:9. The resulting PGR product was subjected to 
a second round of amplification using an adaptor specific nested primer AP2: 5'-ACTCACTATAGGGCTCGAGCGGC- 

45 3', SEQ ID N0:3, and a hpa specific antisense nested primer hpI-666 5'-AGGCTTCGAGCGCAGCAGCAT-3', SEQ ID 
NO:18, corresponding to nucleotides 83-63 of SEQ ID NO:9. The PGR program was as follows: a hot start of 94 "C for 
1 minute, followed by 30 cycles of 90 °G - 30 seconds, 68 °G - 4 minutes. The resulting 300 bp DNA fragment was 
extracted from an agarose gel and cloned into the vector pGEM-T Easy (Promega). The resulting recombinant plasmid 
was designated pHPSKI . 

50 [0150] The nucleotide sequence of the pHPSKI insert was determined and it was found to contain 62 nucleotides of 
the 5' end of the placenta hpa cDNA (SEQ ID NO:9) and additional 178 nucleotides upstream, the first 178 nucleotides 
of SEQ ID N0s:13and 15. 

[0151] A single nucleotide discrepancy was identified between the SK-hepI 1 cDNA and the placenta cDNA. The "T** 
derivative at position 9 of the placenta cDNA (SEQ ID NO:9). is replaced by a "G" derivative at the corresponding position 
55 187 of the SK-hep cDNA (SEQ ID NO: 13). 

[0152] The discrepancy is likely to be due to a mutation at the 5' end of the placenta cDNA clone as confirmed by 
sequence analysis of sevsral additional cDNA clones Isolated from placenta, which like the SK-hepI cDNA contained G 
at position 9 of SEQ ID NQ:9.. 
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[0153] The 5' extended sequence of the SK-hepI hpa cDNA was assembled with the sequence of the hpa cDNA 
isolated from human placenta (SEQ I D NO:9). The assembled sequence contained an open reading frame which encodes, 
as shown in SEQ ID NOs:14 and 15, a polypeptide of 592 amino acids with a calculated molecular weight of 66,407 
daltons. The open reading frame is flanked by 93 bp 5" untranslated region (UTR). 

EXAMPLE 8 

Isolation of the upstream genomic region of the hpa gene 

10 [0154] The upstream region of the hpa gene was isolated using the Genome Walker kit (Clontech) according to the 
manufacturer recommendations. The kit includes five human genomic DNA samples each digested with a different 
restriction endonuclease creating blunt ends: EcoRV, Seal, D/al, PvuW and Ssp\. 

[0155] The blunt ended DNA fragments are ligated to partially single stranded adaptors. The Genomic DNA samples 
were subjected to PGR amplification using the adaptor specific primer and a gene specific primer. Amplification was 

15 performed with Expand High Fidelity (Boehringer Mannheim). 

[0156] A first round of amplification was performed using the apl primer: 5'-G TAATACGACTCACTATAGGGC-3', 
SEQ ID NO:19, and the /7pa specific antisense primer hpl-666: 5'-AGGCTTCGAGCGCAGCAGCAT-3', SEQ ID NO: 18. 
corresponding to nucleotides 83 — 63 of SEQ ID NO:9. The PGR program was as follows: a hot start of 94 "0-3 minutes, 
followed by 36 cycles of 94 °C - 40 seconds. 67 °G - 4 minutes. 

20 [0157] The PGR products of the first amplification were diluted 1:50. One |xl of the diluted sample was used as a 
template for a second amplification using a nested adaptor specific primer ap2: 5 -AGTATAGGGGAGGGGTGGT-3\ 
SEQ ID NO:20, and a hpa specific antisense primer hpl-690. 5'-GTTGGGGTGAGG TGGGTGGTG-3', SEQ ID NO:21, 
corresponding to nucleotides 62-42 of SEQ ID NO:9. The resulting amplification products were analyzed using agarose 
gel electrophoresis. Five different PGR products were obtained from the five amplification reactions. A DNA fragment 

25 of approximately 750 bp which was obtained from the Ssp\ digested DNA sample was gel extracted. The purified fragment 
was ligated into the plasmid vector pGEM-T Easy (Promega). The resulting recombinant plasmid was designated 
pGHP6905 and the nucleotide sequence of the hpa insert was determined. 

[0158] A partial sequence of 594 nucleotides is shown in SEQ ID NO: 16. The last nucleotide in SEQ ID NO:13 
corresponds to nucleotide 93 in SEQ ID:13. The DNA sequence in SEQ ID NO:16 contains the 5' region of the hpa 
30 cDNA and 501 nucleotides of the genomic upstream region which are predicted to contain the promoter region of the 
hpa gene. 

EXAMPLE 9 

35 Expression of the 592 amino acids HPA polypeptide in a human 293 cell line 

[0159] The 592 amino acids open reading frame (SEQ ID NOs:13 and 15) was constructed by ligation of the 110 bp 
corresponding to the 5' end of the SK-hep 1 hpa cDNA with the placenta cDNA. More specifically the Marathon RAGE 
(RIM) PGR amplification product of the placenta hpa DNA was digested with Sad and an approximately 1 kb fragment 

40 was ligated into a Sad-digested pGHP6905 plasmid. The resulting plasmid was digested with Ear\ and Aa(i\. The Ear\ 
sticky ends were blunted and an approximately 280 bp EaH/blunt-Aafll fragment was isolated. This fragment was ligated 
with pFasihpa digested with EcoRI which was blunt ended using Klenow fragment and further digested with Aa1\\. The 
resulting plasmid contained a 1827 bp insert which includes an open reading frame of 1776 bp, 31 bp of 3' UTR and 21 
bp of 5' UTR. This plasmid was designated pfasXLhpa. 

45 [0160] A mammalian expression vector was constructed to drive the expression of the 592 amino acids heparanase 
polypeptide in human cells. The hpa cDNA was excised prom pFastL/7pa with BssHII and A/o/l. The resulting 1850 bp 
SssHll-ZVofl fragment was ligated to a. mammalian expression vector pSI (Promega) digested with Mlu\ and A/ofl. The 
resulting recombinant plasmid, pSI/7paMet2 was transfected into a human 293 embryonic kidney cell line. 
[0161] Transient expression of the 592 amino-acids heparanase was examined by western blot analysis and the 

50 enzymatic, activity was tested using the gel shift assay. Both these procedures are described in length in U.S. Pat. 
application No. 09/071.739. filed May 1. 1998, which is incorporated by reference as if fully set forth herein. Gells were 
harvested 3 days following transfection. Harvested cells were re-suspended in lysis buffer containing 150 mM NaGI, 50, 
mM Tris pH 7.5, 1% Triton X-100, 1 mM PMSF and protease inhibitor cocktail (Boehringer Mannheim). 40 jjig protein 
extract samples were used for separation on a SDS-PAGE. Proteins were transferred onto a PVDF Hybond-P membrane 

55 (Amersham). The membrane was incubated with an affinity purified polyclonal anti heparanase antibody, as described 
in U.S. Pat. application No. 09/071.739. A major band of approximately 50 kDa was observed in the transfected cells 
as well as a minor band of approximately 65 kDa. A similar pattern was observed in extracts of cells transfected with 
the pShpa as demonstrated in U.S. Pat application No. 09/071 ,739. These two bands probably represent two forms of 
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the recombinant heparanase protein produced by the transfected cells. The 65 kDa protein probably represents a 
heparanase precursor, while the 50 kDa protein is suggested herein to be the processed or mature form. 
[0162] The catalytic activity of the recombinant protein expressed in the pS/jpaMet2 transfected cells was tested by 
gel shift assay. Cell extracts of transfected and of mock transfected cells were incubated overnight with heparin (6 img 
- 5 in each reaction) at 37 °C. in the presence of 20 mM phosphate citrate buffer pH 5.4, 1 mM CaClg, 1 mM DTT and 50 
mM NaCI. Reaction mixtures were then separated on a 1 0 % polyacrylamide gel. The catalytic activity of the recombinant 
heparanase was cleariy demonstrated by a faster migration of the heparin molecules incubated with the transfected cell 
extract as compared to the control. Faster migration indicates the disappearance of high molecular weight heparin 
molecules and the generation of low molecular weight degradation products. 

10 

EXAMPLE 10 

Chromosomal localization of the hpa gene 

IS [0163] Chromosomal mapping of the hpa gene was performed utilizing a panel of monochromosomal human/CHO 
and human/mouse somatic cell hybrids, obtained from the UK HGMP Resource Center (Cambridge, England). 
[0164] 40 ng of each of the somatic cell hybrid DNA samples were subjected to PCR amplification using the hpa 
primers: hpu565 5'-AGCTCTGTAGATGTGC TATACAC-3', SEQ ID NO:22, con^esponding to nucleotides 564-586 of 
SEQ ID NO:9 and an antisense primer hpl171 5'-GCATCTTAGCCGTCTTTCTTCG-3', SEQ ID NO:23, corresponding 

20 to nucleotides 897-876 of SEQ ID NO:9. 

[01 65] The PCR program was as follows: a hot start of 94 °C - 3 minutes, - followed by 7 cycles of 94 **C - 45 seconds, 
66 °C - 1 minute, 68 °C - 5 minutes, followed by 30 cycles of 94 °C - 45 seconds, 62*'C - 1 minute, 68°C - 5 minutes, 
and a 10 minutes final extension at 72°C. 

[0166] The reactions were performed with Expand long PCR (Boehringer Mannheim). The resulting amplification 
25 products we re analyzed using agarose gel electrophoresis. As demonstrated in Figure 14, a single band of approximately 
2.8 Kb was obtained from chromosome 4, as well as from the control human genomic DNA. A 2.8 kb amplification 
product is expected based on amplification of the genomic hpa clone (data not shown). No amplification products were 
obtained neither in the control DNA samples of hamster and mouse nor in somatic hybrids of other human chromosome. 
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10 

[0168] 

(1 ) GENERAL INFORMATION: 

15 (i) APPLICANT: Iris Pecker, Israel VLodavsky and Elena Feinstein 

(ii) TITLE OF INVENTION: POLYNUCLEOTIDE ENCODING A POLYPEPTIDE HAVING HEPARANASE AC- 
TIVITY AND EXPRESSION OF SAME IN TRANSDUCED CELLS 

(iii) NUMBER OF SEQUENCES: 23 

(iv) CORRESPONDENCE ADDRESS: 

20 
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(B) STREET: 2940 Birchtree lane 

(C) CITY: Silver Spring 

(D) STATE: Maryland 

25 (E) COUNTRY: United States of America 

(F) ZIP: 20906 

(V) COMPUTER READABLE FORM: 

30 (A) MEDIUM TYPE: 1.44 megabyte. 3.5- microdisk 

(8) COMPUTER: Twinhead* Slimnote-890TX 

(C) OPERATING SYSTEM: MS DOS version 6.2, Windows version 3.1 1 

(D) SOFTWARE: Word for Windows version 2.0 converted to an ASCI file 

35 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

40 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/922,170 

(B) FILING DATE: 2 SEP 1997 

45 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Friedmam, Mark M. 

(B) REGISTRATION NUMBER: 33,883 

50 (C) REFERENCE/DOCKET NUMBER: 910/1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 972-3-5625553 
55 (B) TELEFAX: 972-3-5625554 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID N0:1: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D.) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



CCATCCTAAT ACGACTCACT ATAGCCC 27 



(2) INFORMATION FOR SEQ ID NO:2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GTACTGATGC CATGTAACTC AATC 



(2) INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:3: 

ACTCACTATA GGGCtCGAGC GGC 23 



(2) INFORMATION FOR SEQ ID NO:4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

CCATCTTAGC CCTCTTTCTT C6 22 



(2) INFORMATION FOR SEO ID NO:5: 
(I) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:5: 



TTTTTTTTTT TTTTT 15 



(2) INFORMATION FOR SEQ 10 NO:6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

TTCGATCCCA AGAAGGAATC AAC 23 



(2) INFORMATION FOR SEQ ID NO:7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GTAGTGATGC CATGTAACTG AATC 24 



(2) INFORMATION FOR SEQ 10 NO:8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

Tyr Giy Pro Asp Val Giy Gin Pro Arg 

5 9 

(2) INFORMATION FOR SEO ID NO:9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1721 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 



' CTAGAGCTTT CGACTCTCCG CTGCGCGGCA 
AGATCCTGCT GCGCTCGAAG CCTGCGCTGC 
CGCTGGGTCC CCTCTCCCCT CGCGCCCTGC 
ACCTGGACTT cTTCACCCAG GAGCCGCTCC 
CCATTGACCC CAACCTGGCC ACGGACCCGC 
TTCGTACCTT GGCCAGAGGC TTGTCTCCTG 
ACTTCCTAAT TTTCGATCCC AAGAAGGAAT 
CrCAAGTCAA CCACGATAT7 TCCAAA7ATC 
TACCGTTGGA ATCCCCCTAC CAGGAGCAAT 
TCAAGAACAG CACCTACTCA AGAAGCTCTG 
CAGGACTCGA CTTGATCTTT GGCCTAAATG 
ACAGTTCTAA TCCTCACTTG CTCCTGGACT 
GGGAACTAGC CAATGAACCT AACAGT7TCC 
CGCAGTTAGG AGAAGATTAT ATTCAATTGC 
ATCCAAAACT CTATGGTCCT GATGTTGGTC 
AGAGCTTCCT CAAGGCTCGT GGAGAAGTCA 
TGAATGGAC6 GACTGCTACC AGGGAAGATT 
TTTCATCTGT 6CAAAAAGTT TTCCA66T6G 
GGTTAGGAGA AACAAGCTCT GCATATGGAG 
CAGCrCGCTT TATGTGGCTG GATAAATTGG 
TGATGAGCCA AGTATTCTTT GGACCAGGAA 
CTTTACCTGA TTATTGCCTA TCTCTTCTGT 
TGGCAAGCGT GCAAGGTTCA AAGAGAAGGA 
CTGACAATCC AAGGTATAAA GAAGGAGATT 
TCACCAACTA CTTCCC6TTA CCCTATCCTT 
TAAGACCTTT GCGACCTCAT CGAITACTTT 
TAAAGATGGT CGATGATCAA ACCTTCCCAC 
CTTCACTGGC CTTGCCACCT TTCTCATATA 
CTGCT7GCAT CTCAAAATAA AATATACTAG 



GCTGGCGCGG GGAGCAGCCA GGTGAGCCCA 60 
CGCCGCXGCT GATGCTGCTC CTCCTGG6GC 120 
CCCGACCTGC GCAAGCACAG GACCTCGTGG 160 
ACCTG6TGAG CCCCTCGTTC CTGTCCGTCA 240 
GGTTCCTCAT CCTCCTGGGT TCTCCAAAGC 500 
CGTACCTGAG CrTTGGTGGC ACCAAGACAG 360 
CAACCTTTGA AGAGAGAAGT TACTGGCAAT 420 
GATCXATCCC TCCTGATGTC GAGGAGAAGT 480 
TGCTACTCCG AGAACACTAC CAGAAAAAGT 540 
TAGATGTGCT ATACACTTTT CCAAACTGCT 600 
CGTTATTAAG AACAGCAGAT TTGCAGTGGA 660 
ACTGCTCTTC CAAGGGGTAT AACATTTCTT 720 
TTAAGAAGGC TCATATTTTC ATCAATGGC7 780 
ATAAACT7CT AAGAAAGTCC ACCTTCAAAA 840 
AGCCTCGAAG AAAGACGGCT AAGATGCTGA 900 
TTGATTCAGT TACATCGCAT CACTACTATT 960 
TTCTAAACCC TCATGTATTG GACATTTTTA 1020 
TTGAGAGCAC CAGGCCTGGC AAGAAGCTCT 1080 
CCGGAGCCCC CTTGCTATCC GACACCTTTG 1140 
CCCTGTCAGC CCGAATGGGA ATAGAAGTGG 1200 
ACTACCATTT AGTGCATGAA AACTTCGATC 1260 
TCAAGAAATT GGTGGCCACC AAGGTGTTAA 1320 
ACCTTCCAGT ATACCrTCAT rCCACAAACA 1380 
TAACTCTCTA TGCCATAAAC CTCCATAACG 14C0 
TTTCTAACAA GCAAGTGGAT AAATACCTTC 1500 
CCAAATCTGT CCAACTCAAT GGTCTAACTC 1560 
CTTTAATCGA AAAACCTCTC CGCCCAGGAA 1620 
GTTTTrTTGT GATAACAAAT GCCAAACTrC 1660 
TCCTGACACT G 1721 



(2) INfORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:10: 
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Het Leu Leu Arg Ser Lys Pro Ala Leu Pro Pro Pro Leu Met Leu Leu 

5 10 15 

Leu Leu Giy Pro Leu Cly Pro Leu Ser Pro G(y Ala Leu Pro Arg Pro 

20 25 30 

Ala Gin Ala Gin Asp Val Val Asp Leu Asp Phe Phe Thr Gin Clu Pro 
33 60 45 

Leu His Leu Val Ser Pro Ser Phe Leu Ser Val Thr He Asp Ale Asn 
50 55 60 

Leu Ala Thr Asp Pro Arg Phe Leu IVe Leu Leu Gly Ser Pro Lys Leu 
65 70 75 eO 

Arg Thr Leu Ala Arg Gly Leu Ser Pro Ala 7yr Leu Arg Phe Cly Gly 

65 90 95 

Thr Lys Thr Asp Phe Leu lie Phe Asp Pro Lys Lys Giu Ser Thr Phe 
100 105 1)0 

Clu Glu Arg Ser Tyr Trp Gin Ser Gin Val Asn Gin Asp lie Cys Lys 
115 T20 125 

Tyr Gly Ser lie Pro Pro Asp Val Glu Glu Lys Leu Arg Leu Glu Trp 
130 135 140 

Pro Tyr' Gin Clu Gin Leu Leu leu Arg Clu His Tyr Cln Lys Lys Phe 
145 150 155 160 

Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Val teu Tyr Thr Phe 

165 170 175 

Ala Asn Cys Ser Gly Leu Asp Leu He Phe Gly Leu Asn Ala Leu Leu 
180 165 190 

Arg Thr Ala Asp Leu Gin Trp Asn Ser Ser Asn Ala Gin Leu Leu Leu 
195 200 205 

Acp tyr Cys Ser Ser Lys Gly Tyr Asn He Ser Trp Hlu Leu Cly Asn 
210 215 220 

Glu Pro Asn Ser Phe Leu Lys Lys Ala Asp lie Phe lie Asn Gly Ser 
225 230 235 240 

Gin Leu Gly Glu Asp Tyr He Gin Leu His Lys Leu Leu Arg Lys Ser 

245 250 255 

Thr Phe Lys Asn Ala Lys Leu Tyr Cly Pro Asp Val Gly Gin Pro Arg 
260 265 270 

Arg Lys Thr Ala Lys Met Leu Lys Ser Phe Leu Lys Ala Cly Cly Glu 
275 280 285 

Val He Asp Ser Val Thr Trp His His Tyr Tyr Leu Asn Gly Arg Thr 
290 295 300 

Ala Thr Arg Glu Asp Phe Leu Asn Pro Asp Val Leu Asp He Phe He 
305 310 315 320 

Ser Ser Val Gin Lys Val Phe Gin Vat Val Clu Ser Thr Arg Pro Gly 

325 330 335 

Lys Lys Val Trp leu Gly Glu Thr Ser Scr Ala Tyr Cly Gly Gly Ala 
340 345 350 



Pro Leu Leu Ser Asp Thr Phe Ala Ala Cly Phe Ket Irp Leu Asp Lys 
355 360 365 

Leu Gly Leo Scr Ala Arg Met Cly He Clu Val Val Met Arg Cln Val 
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10 



370 375 380 

Phe Phe Gly At a Gly Asn Tyr His Leu Val Asp Glu Asn Phe Asp Pro 
385 390 395 AOO 

Leu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu Val Gly Thr 

405 410 415 

Lys Val Leu Met Ala Ser Val Gin Gly Ser Lys Arg Arg Lys Leu Arg 

420 425 430 

Val Tyr Leu His Cys Thr Asn Thr Asp Asn Pro Arg Tyr Lys Glu Cty 
435 440 445 

Asp Leu Thr Leu Tyr Ala lie Asn Leu His Asn Val Thr Lys Tyr Leu 
450 455 460 

Arg Leu Pro Tyr Pro Phe Ser Asn Lys Gin Val Asp Lys Tyr Leu Leu 
465 470 475 480 

Arg Pro Leu Gly Pro His Gly Leu Leu Ser Lys Ser Val Gin Leu Asn 

485 490 495 

20 Gly Leu Thr Leu Lys Met Val Asp Asp Gin Thr Leu Pro Pro Leu Met 

500 505 510 

Glu Lys Pro Leu Arg Pro Gly Ser Ser Leu Gly Leu Pro Ala Phe Ser 
515 520 525 



15 



25 



45 



50 



Tyr Ser Phe Phe Val lie Arg Asn Ala Lys Val Ala Ala Cys lie 
530 535 540 543 



(2) INFORMATION FOR SEO ID NO:11 : 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1718 

(B) TYPE: nucleic acid 

(C) STRAHDEDHESS: double 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11 



40 



55 
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* 


CT AGA 


CCT 


TTC 


GAC 


1A 




TCT CCG 


CTG 


CGC 


GGC 


AGC 


TGG CGC 


GGG GAC 


CAG CCA GGT 


GAG 


CCC 


AAC 


62 


5 


ATG -TG 


CTG 


CGC~ 


TCG' 


AAG 


CCT GCG 


CTG CCG 


CCG CCC CTG 


ATC 


CTG 


CTG 


110 




Met Leu 


Leu 


Arg 


Ser 


tys 


Pro Ala 


Leu Pro 


rr\J r9W LCU 




Leu 












5 






10 






15 








CTC CTG 


GGG 


CCG 


CTG 


GGT 


CCC CTC 


TCC CCT 


GGC CCC CTG 


CCC 


CGA 


CCT 


158 




Leu Leu Gly 


Pro 


Leu Gly 


Pro Leu 


Ser Pro 


uiy Aid Leu 


rro 


A v>n 

nrg 


rTO 




10 






20 








25 




30 










GCG CAA 


GCA 


CAG 


GAC 


GTC 


GTG GAC 


CTG GAC 


TTC TTC ACC 


CAG 


GAG 


CCG 


206 




Ala Cln Ala 


Gin 


Asp 


Val 


Val Asp Leu Asp 


Phe Phe Thr 


Cln 


Glu 


Pro 








35 








AO 




A5 












CTC CAC 


CTC 


GTG 


AGC 


CCC 


TCG TTC 


CTG TCC 


GTC ACC ATT 


GAC 


GCC 


AAC 


254 


15 


Leu His 


Leu 


Val 


Ser 


Pro 


Ser Phe 


Leu ser 


val Thr lie 


Asp 


Ala 


Asn 






50 










55 




60 












CTG GCC 


ACG 


GAC 


CCG 


CGG 


TTC CTC 


ATC CTC 


CTG GGT TCT 


CCA 


AAG 


CTT 


302 




Leu Ala 


Thr 


Asp 


Pro Arg 


Phe "Leu 


lie Leu 


Leu biy aer 


r*ro 


Lys 


Leu 






65 






70 






75 






80 




20 




























CGT ACC 


TTC 


GCC 


AGA 


GGC 


TTG TCT 


CCT GCG 


TAC CTG AGC 


TTT 


GGT 


GGC 


350 




Arg Thr 


Leu 


Ala 


Arg 


Gly 


Leu Ser 


Pro Ala 


Tyr Leu Arg 


Phe 


Gly 


Gty 










85 






90 






95 








ACC AAG 


ACA 


GAC 


TTC 


CTA 


ATT TTC 


CAT CCC 


AAG AAfi GAA 


TCA 


ACC 


TTT 


398 


25 


Thr Lys 


Thr 


Asp 


Phe 


Leu 


He Phe Asp Pro 


Lys Lys Clu 


Ser 


Thr 


Phe 








100 








105 




110 










GAA GAG 


AGA 


ACT 


TAC 


TGG 


CAA TCT 


CAA CTC 


AAC CAG CAT 


ATT 


TGC 


AAA 






Glu G(u 


Arg 


Ser 


Tyr 


Trp 


Cln Ser 


Cln Val 


Asn Cln Asp 


J le 


Cys 


Lys 





120 125 



30 



35 



40 



45 



50 
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TAT CGA TCC ATC CCT CCT GAT CTG GAG GAG AAC TTA CGG TTG GAA TGG 494 
Tyr Gly Ser lie Pro Pro Asp Val Glu Glu Lys Leu Arg Leu Clu Trp 
130 135 KO 

CCC TAG CAC GAG CAA TTG CTA CTC CCA GAA CAC TAC CAG AAA AAC TTC 542 
Pro Tyr Gin Glu Gin Leu Leu Leu Arg Gtu His Tyr Gin Lys Lys Phe 
145 • 150 155 160 

AAG AAC AGC ACC TAC TCA AGA ACC TCT GTA CAT CTC CTA TAC ACT TTT 590 
Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Val Leu Tyr Thr Phe 

165 170 175 

GCA AAC TGC TCA CGA CTC CAC TTG ATC TTT GGC CTA AAT GCG TTA TTA 638 
Aid Asn Cys Ser Gly Leu Asp Leu lie Phe Gly Leu Asn Ala Leu Leu 
180 1S5 190 

AGA ACA CCA CAT TTG CAG TGG AAC ACT TCT AAT CCT CAC TTG CTC CTC 686 
Arg Thr Ala Asp Leu Gin Trp Asn Ser Ser Asn Ala Gin Leu Leu Leu 
195 200 205 

CAC TAC TGC TCT TCC AAG GGC TAT AAC ATT TCT TGG CAA CTA CCC AAT 734 
Asp Tyr cys Ser Ser Lys Gly Tyr Asn lie Ser Trp Clu Leu Gly Asn 
210 215 220 

GAA CCT AAC ACT TTC CTT AAG AAC CCT CAT ATT TTC ATC AAT CGG TCC 782 
Glu Pro Asn Ser Phe Leu Lys Lys Ala Asp lie Phe lie Asn Gly Ser 
225 230 235 240 

CAC TTA GCA CAA CAT TAT ATT CAA TTG CAT AAA CTT CTA AGA AAG TCC 830 
Gin Leu Gly Glu Asp Tyr lie Gin Leu His Lys Leu Leu Arg Lys Ser 

245 250 255 

ACC TTC AAA AAT CCA AAA CTC TAT CCT CCT GAT CTT GGT CAG CCT CGA 878 
Thr Phe Lys Asn Ala Lys Leu Tyr Gly Pro Asp Val Gly Gin Pro Arg 
260 265 270 

AGA. AAG ACC CCT AAG ATC CTG AAG AGC TTC CTG AAG CCT GGT CGA GAA 926 
Arg Lys Thr Ala Lys Het Leu Lys Ser Phe Leu Lys Ala Gly Gly Glu 
275 280 285 

CTG ATT GAT TCA CTT ACA TGG CAT CAC TAC TAT TTC AAT CGA CCC ACT 974 
Vat lie Asp Ser Val Thr Trp His His Tyr Tyr Leu Asn Cly Arg Thr 
290 295 300 

CCT ACC AGC GAA GAT TTT CTA AAC CCT CAT GTA TTG CAC ATT TTT ATT 1019 
Ala Thr Arg Glu Asp Phe Leu Asn Pro Asp Val Leu Asp lie Phe lie 
305 310 315 320 

TCA TCT GTG CAA AAA GTT TTC CAG GTG CTT CAG AGC ACC AGG CCT GGC 1067 
Ser Ser Val Gtn tys Val Phe Gin Val Val Glu Ser Thr Arg Pro Gly 

325 330 335 

AAG AAG GTC TGG TTA GCA GAA ACA AGC TCT CCA TAT CGA GGC CGA CCC 1115 
Lys Lys Val Trp Leu Gly Glu Thr Ser Ser Ala Tyr Cly Gly Gly Ala 
340 345 350 

CCC TTG CTA TCC GAC ACC TTT GCA CCT CCC TTT ATC TGG CTG GAT AAA 1163 
Pro. Leu Leu Ser Asp Thr Phe Ala Ale Gly Phe Het Trp Leu Asp Lys 
355 360 365 

TTG GGC CTG TCA GCC CGA ATG CGA ATA CAA GTC GTG ATC AGC CAA GTA 1211 
Leu Gly Leu Ser Ala Arg Met Cly lie Glu Val Val Het Arg Gin Val 
370 375 380 

TTC TTT CGA CCA GCA AAC TAC CAT TTA GTG GAT GAA AAC TTC GAT CCT 1259 
Phe Phe Cly Ala Gly Asn Tyr His Leu Val Asp Clu Asn Phe Asp Pro 
385 390 395 400 

TTA CCT GAT TAT TGG CTA TCT CTT CTG TTC AAG AAA TTC GTG GGC ACC 1307 
Leu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu Val Cly Thr 

405 410 415 

AAG CTC TTA ATC GCA ACC GTG CAA CCT TCA AAG ACA ACC AAC CTT CCA 1355 
Lys Val leo Met Ala Ser Val Gin Cly Ser Lys Arg Arg Lys Leu Arg 
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A20 425 430 

GTA TAG CTT CAT TGC ACA AAC ACT CAC AAl CCA AGG TAT AAA GAA GGA U03 
Val Tyr Leu His Cys Thr Asn Thr Asp Asn Pro Arg Tyr Lys Clu Cly 
435 440 445 

CAT TTA ACT CTC TAT CCC ATA AAC CTC CAT AAC GTC ACC AAG TAC TTG 1451 
Asp Leu Thr Leu Tyr Ala tte Asn Leu His Asn Val Thr Lys Tyr Leu 
450 455 460 

CGG TTA CCC TAT CCT TTT TCT AAC AAG CAA GTC CAT AAA TAC CTT CTA 1499 
Arg Leu Pro Tyr Pro Phe Ser Asn Lys Gin Val Asp Lys Tyr Leu Leu 
465 470 475 480 

AGA CCT TTG GGA CCT CAT GGA TTA CTT TCC AAA TCT GTC CAA CTC AAT 1547 
Arg Pro Leii Cly Pro His Gly Leu Leu Ser Lys Ser Val Gin Leu Asn 

485 490 495 

GGT CTA ACT CTA AAC ATC GTC GAT GAT CAA ACC TTG CCA CCT TTA ATG 1595 
Gly Leu Thr Leu Lys Met Val Asp Asp Gin Thr Leu Pro Pro Leu Met 
500 505 510 

GAA AAA CCT CTC CGG CCA GGA ACT TCA CTG GGC TTG CCA GCT TTC TCA 1643 
20 Glu Lys Pro Leu Arg Pro Cly Ser Ser Leu Gly Leu Pro Ala Phe Ser 

515 520 525 

TAT ACT TTT TTT GTG ATA ACA AAT GCC AAA GTT GCT GCT TGC ATC TGA 1691 
Tyr Ser Phe Phe Val He Arg Asn Ala Lys Val Ala Ala Cys He 
530 535 540 543 



15 



25 



AAA TAA AAT ATA CTA GTC CTG ACA CTG 1718 



(2) INFORMATION FOR SEQ ID NO:12: 
30 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12 



CTGGCAAGAA GGTCTGGTTG GGAGACACGA 
TGTCCAACAC CTTTGCAGCT CCCTTTATGT 
TGGGCATAGA AGTCGTGATG AGGCAGGTGT 
ATGAAAACTT TCAGCCTTTA CCTGATTACT 
GTCCCAGGGT GTTACTGTCA AGAGTGAAAG 
TCCACTGCAC TAACCTCTAT CACCCACGAT 
TGAACCTCCA TAAT6TCACC AAGCACTTGA 
45 T6GATACGTA CCTTCTGAAG CCTTCGGGGC 

TGAACGGTCA AATTCTGAAG ATGGTGGATG 
CTCTCCCCGC AGGAA6TGCA CTAAGCCTGC 
GAAATGCCAA AATCGCTGCT TGTATATGAA 
AAGCCGACGG GGGTCrTATT CATAAAACAA 
GAGTTCCAGA GCTTCGGGAG GGTGGGGTAC 
CTCTAAGAAG AATACTGCAG GTGGTGACAG 



GCTCAGCTTA CGCTGGCGGT GCACCCTTGC 60 
GGCTGGATAA ATTGGCCCTG TCAGCCCAGA 120 
TCTTCCGAGC AGGCAACTAC CACTTAGTGG 180 
GGCTCTCTCT TCTGTTCAAG AAACTGGTAG 240 
GCCCAGACAG GAGCAAACTC CGAGTGTATC 300 . 
ATCAGGAAGG AGATCTAACT CTGTATGTCC 360 
AGGTACCCCC TCCGTTGTTC ACGAAACCAG 420 
CGGATCGATT ACTTTCCAAA TCTGTCCAAC A80 
AGCAGACCCT GCCAGCTTTG ACAGAAAAAC 540 
CTGCCTTTTC CTAT6GTTTT TTT6TCATAA 600 
AATAAAAGGC ATACGGTACC CCTGAGACAA 660 
AACCCTAGTT TAGCAGGCCA CCTCCTTGCC 720 
ACTTCA6TAT TACATTCAGT GTCGTGTTCT 780 
TTAATA6CAC TGTG 824 



(2) INFORMATION FOR SEQ ID NO:13: 
(i) SEQUENCE CHARACTERISTICS: 

55 

(A) LENGTH: 1899 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:13 



10 



15 



GGGAAAGCCA 
CAGTGGGAGG 
AGGCGTAACG 
AGAGCTCTCG 
ATGCTGCTGC 
CTGGGTCCCC 
CTCCACTTCT 
ATTGACGCCA 
CGTACCTTCG 
TTCCTAATTT 
CAAGTCAACC 
CGCTIGGAAr 



GCAAGGAAGT 
GATGCAGAAG 
GCCCGGAGGA 
ACTCTCCGCT 
GCTCGAAGCC 
TCTCCCCTGC 
TCACCCAGGA 
ACCTCGCCAC 
CCAGAGGCTT 
TC6ATCCCAA 
AGCATATTTC 
CGCCCTACCA 



AGGAGAGAGC 
AGGAGTGGGA 
AAGGAGAAAA 
CCGCGCCAGC 
TGCGCTGCCC 
CGCCCTGCCC 
GCCCCTGCAC 
CGACCCGCGG 
GTCTCCTGCC 
GAAGGAATCA 
CAAATATCGA 
CCAGCAATTC 



CGGGCAGGCG 
GGGATGGAGG 
GGGCGCTGGG 
TGCCGGGGGG 
CCGCCGCTGA 
CGACCTGCGC 
CTGGTGAGCC 
TTCCTCATCC 
TACCTGACGT 
ACCTTTCAAG 
TCCATCCCTC 
CTACTCCGAC 



GGCCGGGGTT 
6CCCAGTGGG 
6CTCGGCGGG 
AGCAGCCAGG 
TGCTGCTGCT 
AAGCACAGGA 
CCTCCTTCCT 
TCXTGGGTTC 
TTGGTGGCAC 
AGAGAAGTTA 
CTGATGTGGA 
AACACTACCA 



GGATTGGGAG 60 

AGGGGTGAGG 120 

AGGAAGTGCT 180 

TGAGCCCAAG 2^0 

CCTGGGCCCG 500 

CCTCGTCGAC 360 

GTCCGTCACC 420 

TCCAAAGCTT 480 

CAAGACACAC S40 

CTCGCAATCT 600 

CGACAAGTTA 660 

GAAAAAGT 1C 720 



AACAACAGCA CCTACTCAAG AACCTCTCTA GATGTGCTAT ACACTTTTGC AAACTCCTCA 780 

GGACTGGACT TGATCTTTGG CCTAAATGCG TTATTAAGAA CAGCAGATTT GCAGTGGAAC 840 

AGTTCTAATG CTCAGTTGCT CCT6GACTAC T6CTCTTCCA A6G6GTATAA CATTTCTTG6 900 

20 CAACTAGGCA ATGAACCTAA CA6TTTCCTT AAGAAGCCTC ATATTTTCAT CAATG6GTCG 960 

CAGTTACGAC AAGATTATAT TCAATTGCAT AAACTTCTAA GAAAGTCCAC CTTCAAAAAT 1020 

GCAAAACTCT ATGGTCCTGA TGTTGGTCAG CCTCGAAGAA AGACGGCTAA GATGCTGAAG 1080 

AGCTTCCTGA AGGCTG6TGG AGAAGTGATT 6ATTCAGTTA CATGGCATCA CTACTATTTG 1140 

AATGGACGGA CTGCTACCAG ^GGAAGATTTT CTAAACCCTG ATGTATT6GA CATTTTTATT 1200 

TCATCTGTGC AAAAAGTTTT CCAGGTGGTT GAGAGCACCA GGCCTCGCAA GAAGGTCTGG 1260 

25 TTAC6AGAAA CAAGCTCTGC ATATGGAGGC 6GAGCGCCCT TGCTATCCGA CACCTTTGCA 1320 

GCTGGCTTTA TGTGGCTGGA TAAATTCGGC CTGTCAGCCC GAATGGGAAT AGAAGTGGTG 1380 

ATGAGGCAAG TATTCTTTGG AGCAGGAAAC TACCATTTA6 TGGATGAAAA CTTCGATCCT 1440 

TTACCTGATT ATTC6CTATC TCTTCTCTTC AAGAAATTGG TGGGCACCAA GGTGTTAATG 1500 

GCAAGCGTCC AACGTTCAAA GA6AAGGAAG CTTC6A6TAT ACCTTCATTC CACAAACACT 1560 

GACAATCCAA GGTATAAAGA AGCAGATTTA ACTCTGTAT6 CCATAAACCT CCATAACGTC 1620 

ACCAAGTACT TGCGGTTACC CTATCCTTTT TCTAACAACC AAGTGGATAA ATACCTTCTA 1680 

30 AGACCTTTGG GACCTCATGG ATTACTTTCC AAATCTGTCC AACTCAATGG TCTAACTCTA 1740 

AAGATGCTGG ATCATCAAAC CTTGCCACCT TTAATGGAAA AACCTCTCCG GCCAGGAAGT 1800 

TCACTGGGCT TGCCAGCTTT CTCATATA6T TTTTTTGTGA TAAGAAATGC CAAAGTTGCT 1860 

GCTTCCATCT GAAAATAAAA TATACTAGTC CTGACACTG 1899 



35 (2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: singi 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14 



50 



55 
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Het Glu Gly 
Glu Arg Arg 
Ala Leu Asp 
Gly Glu Pro 
Pro Leu Met 
Gly Ala Leu 
Asp Phe Phe 
Leu Ser Val 
Leu 1 le Leu 
Leu Ser Pro 
Leu lie Phe 
Tyr Trp Cln 
He Pro Pro 
Gin Glu Gin 
Asn Ser Thr 
Ala Asn Cys 
Leu Arg Thr 
Leu Leu Asp 
Leu Gly Asn 
1 1 e Asn C I y 
Leu Leu Arg 
Asp Val Gly 
Phe Leu Lys 



Ala 


Val 


Gly 




5 




Lys Cly 


Arg 




20 




Ser 


Pro 


Leu 




35 




Lys 


Het 


Leu 


• 


-50 




Leu 


Leu 


Leu 




65 




Pro 


Arg 


Pro 




80 




Thr 


Gin 


Glu 




95 




Thr 


He 


Asp 




110 




Leu Gly 


Ser 




125 




Ala 


Tyr 


Leu 




140 




Asp Pro 


Lys 




155 




Ser 


Gin 


Val 




170 




Asp Val 


Glu 




1B5 




Leu 


Leu 


Leu 




200 




tyr 


Ser 


Arg 




215 




Ser 


Cly 


Leu 




230 




Ala Asp 


Leu 




245 




Tyr 


Cys 


Ser 




260 




Glu 


Pro 


Asn 




275 




Ser 


Cln 


Leu 




290 




Lys 


Ser 


Thr 




305 




Gin 


Pro 


Arg 




320 




Ala 


Gly 


Gly 




335 





Gly Val Arg 
Trp Gly Ser 
Arg Gly Ser 
Leu Arg Ser 
Leu Gly Pro 
Ala Gin Ala 
Pro Leu His 
Ala Asn Leu 
Pro Lys Leu 
Arg Phe Gly 
Lys Glu ser 
Asn Gin Asp 
Glu Lys Leu 
Arg Glu His 
Ser Ser Val 
Asp Leu He 
Gin Trp Asn 
Ser Lys Cly 
Ser Phe Leu 
Cly Glu Asp 
Phe Lys Asn 
Arg Lys Thr 
Glu Val I le 



Arg 


Arg 


Asn 


10 






Ala 


Gly 


Gly 


25 






Trp 


Arg 


Gly 


40 






Lys 


Pro 


Ala 


55 






Leu 


Gly 


Pro 


70 






Gin 


Asp 


Val 


o5 






Leu 


vaL 


ser 


WO 






Ala 


Thr 


Asp 


• 15, 






Arg 


Thr 


Leu 


•30 






Gly 


Thr 


Lys 


« / IT 

145 






Thr 


Phe 


Glu 


160 






He 


Cys 


Lys 


175 






Arg 


Leu 


Glu 


190 






Tyr 


Gin 


Lys 


205 






Asp 


Val 


Leu 


220 






Phe 


Gly 


Leu 


235 






Ser 


Ser 


Asn 


250 






Tyr 


Asn 


1 le 


265 






Lys 


Lys 


Ala 


280 






Tyr 


He 


Gin 


295 






Ala 


Lys 


Leu 


310 






Ala 


tys 


Het 


325 






Asp 


Ser 


Val 


340 







Gly Ala Glu 
15 

Ser Ala Arg 
30 

Glu Gin Pro 

45 

Leu Pro Pro 
60 

Leu Ser Pro 
75 

Val Asp Leu 
90 

Pro Ser Phe 
105 

Pro Arg Phe 
120 

Ala Arg Gly 
135 

Thr Asp Phe 
150 

Glu Arg Ser 
165 

Tyr Cly Ser 
180 

Trp Pro Tyr 
195 

Lys Phe tys 
210 

Tyr Thr Phe 
225 

Asn Ala Leu 
240 

Ala Gin Leu 
255 

Ser Trp Glu 
270 

Asp He Phe 
285 

Leu His Lys 
300 

Tyr Gly Pro 
315 

Leu Lys Ser 
330 

Thr Trp His 
345 
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15 



20 



25 



Hts 


Tyr 


Tyr 


Leu 


Asn 


Gly 


Arg inr 


Ala 


Thr* 

1 nr 


Hi g 






Phe 


1 At J 






mn 
















360 


JL m mm 

Asn 


Pro 


Asp 


vai 


Leu 


ASp 


f 1 » Oka 

lie Kne 


1 1 » 


Cap 


90 


Val 


Gin 


Lvs 


Val 
























375 


Phe 


ft mm. 

cm 


Val 


val 


ul U 


cr A ^ 

acr 


A 

1 nr AFg 


r 1 O 


uiy 


Ly» 




Val 


Trp 


Leu 


















385 










390 


Gly 


Glu 


Thr 


Ser 


ser 


Ala 


lyr uiy 


u ly 


ft 1 u 

i»iy 






Leu 
w ^ w 


Leu 


Ser 


























405 


Asp 


Thr 


Phe 


Ala 


Ala 


Gly 


rne net 


irp 


Leu 


A c A 
ASp 


Lys 


W w 


Glv 


1 All 








/in 


















420 


Ser 


Ala 


Arg 


Met 


Gly 


1 le 


GLU vai 


Val 


net 


Arg 


bin 


Vol 




rne 


























•* J J 


Gly 


Ala 


Gly 


Asn 


Tyr 


His 


Leu Val 


Asp 


Glu 


A »M 

Asn 


pne 


ASp 


fro 


Leu 


























pro 


Asp 


Tyr 


Trp 


Leu 


Ser 


Leu Leu 


Phe 


1 - 

Lys 


Lys 


Leu 


vai 


Illy 


I nr 












HOW 












Lys 


Val 


Leu 


Met 


Ala 


Ser 


Val Gin 


Gly 


Ser 


Lys 


Arg 


A 

Arg 


Lys 


Leu 








470 


















•lOU 


Arg 


Val 


Tyr 


Leu 


His 


Cys 


Thr Asn 


Thr 


Asp 


Asn 


Pro 


Arg 


Tyr 


Lys 






485 


















4yD 


Glu 


Gly 


Asp 


Leu 


Thr 


Leu 


Tyr Ala 


I le 


Asn 


Leu 


Hts 


Asn 


Val 


Thr 






r «h 

500 








505 










>TU 


Lys 


Tyr 


Leu 


Arg 


Leu 


Pro 


Tyr Pro 


Phe 


Ser 


Asn 


A _ , 

Lys 


Gin 


Val 


Amh 

Asp 






515 




















Lys 


Tyr 


Leu 


Leu 


Arg 


Pro 


Leu Gly 


Pro 


Kis 


Gly 


Leu 


A _ _ 

Leu 


Ser 


Lys 






530 








535 










540 


Ser 


Val 


Gin 


Leu 


Asn 


Gly 


Leu Thr 


Leu 


tys 


Met 


Val 


Asp 


Asp 


Gin 










545 








550 










555 


Thr 


Leu 


Pro 


Pro 


Leu 


Met 


Glu Lys 


Pro 


Leu 


Arg 


Pro 


Gly 


Ser 


Ser 










560 








565 










570 


Leu 


Gly 


Leu 


pro 


Ala 


Phe 


Ser Tyr 


Ser 


Phe 


Phe 


Val 


He 


Arg 


Asn 






575 








580 










585 


Ala 


Lys 


Val 


Ala 


Ala 


Cys 


lie 






















590 




592 

















30 (2) INFORMATION FOR SEO ID NO:1 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1899 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15 



45 



50 



55 
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10 



15 



20 



GGG 

A/UV GCG AGC AAG GAA GTA GGA GAG A6C CC6 6CA CGC GGG GCG GGG 
TTG GAT TG6 GAG CAC TGG GAG GGA TGC AGA AGA GGA GTG GGA GGG 
ATG GAG GGC GCA GTG GGA GGG GTG AGG AGG CGT AAC GGG GCG GAG 
Met Glu Gly Ala Val Gly Gly Val Arg Arg Arg Asn Gly Ala Glu 

5 10 15 

GAA AGG AGA AAA GGG CGC TGG GGC TCG GCG GGA GGA ACT GCT AGA 
Glu Arg Arg Lys Gly Arg Trp Gly Ser AU Gly Gly Ser Ala Arg 

20 25 30 

GCT CTC GAC TCT CCC GTG CGC GGC AGC TGG CGG GGG GAG CAG CCA 
Ala Leu Asp Ser Pro Leu Arg Gly Ser Trp Arg Gly Glu Gin Pro 

35 40 AS 

GGT GAG CCC AAG ATG CTG CTG CGC TCG AAG CCT GCG CTG CCG CCG 
Gly Glu Pro Lys Met Leu Leu Arg Ser Lys Pro Ala Leu Pro Pro 

50 55 60 

CCC CTG ATG CTG CTG CTC CTG GGG CCG CTG GGT CCC CTc TCC CCT 
Pro Leu Met Leu Leu Leu Leu Gly Pro Leu Gly Pro Leu Ser Pro 

65 70 75 

GGC GCC CTG CCC CCA CCT GCG CAA GCA CAG GAC GTc GTG GAC CTG 
Gly Ala Leu Pro Arg Pro Ala Gin Ala Gla Asp Val Val Asp Leu 

80 85 90 

GAC Tic TTC ACC CAG GAG CCG CTC CAC CTG CTC AGC CCC TCG TTC 
Asp Phc Phe Thr Gin Glu Pro Leu His Leu Val Ser Pro Ser Phe 

95 100 105 



25 



3 
48 
93 
138 



183 



228 



273 



318 



363 



408 



CTC TCC CTC ACC ATT GAC CCC AAC CTG CCC ACC CAC CCG CGC TTC 453 



30 



35 



40 



45 



50 



55 



33 



EP 0 998 569 B1 



10 



15 



20 



25 



Leu Ser Val Thr lie Asp Ala Asn Leu Ala Thr Asp Pro Arg Phe 

no 115 120 

CTC ATC CTC CTC GCT TCT CCA AAG CTT CGT ACC TTG CCC A6A CGC 498 
Leu lie Leu Leu Gly Ser Pro Lys Leu Arg Thr Leu Ala. Arg Gly 

125 130 135 

TTC TCT CGT GCG TAC CT6 ACC TTT GGT 6CC ACC AAG ACA GAC TTC 543 
Leu Ser Pro Ala Tyr Leii Arg Phe Gly Gly Thr Lys Thr Asp Phe 

140 145 150 

CTA ATT TTC GAT CCC AAG AAG GAA TCA ACC TTT GAA GAG AGA AGT 586 
Leu He Phe Asp Pro Lys Lys Glu Ser Thr Phe Glu Glu Arg Ser 

155 160 165 

TAC TGG CAA TCT CAA GTC AAC CAG GAT ATT TCC AAA TAT GGA TCC 633 
Tyr Trp Gin Ser Gin Val Asn Gin Asp lie Cys Lys Tyr Gly Ser 

170 175 180 

ATC CCT CCT GAT CTG GAG GAG AAG TTA CGG TTG GAA TGG CCC TAC 678 
He Pro Pro Asp Val Glu Glu Lys Leu Arg Leu Glu Trp Pro Tyr 

185 190 195 

CAG GAG CAA TTC CTA CTC CGA GAA CAC TAC CAG AAA AAC TTC AAG 723 
Gin Glu Gin Leu Leu Leu Arg Glu His Tyr Gin Lys Lys Phe Lys 

200 205 210 

AAC ACC ACC TAC TCA AGA ACC TCT CTA GAT GTG CTA TAC ACT TTT 768 
Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Val Leu Tyr Thr Phe 

215 220 225 

GCA AAC TGC TCA CGA CTG GAC TTG ATC TTT GGC CTA AAT GCG TTA 813 
Ala Asn Cys Ser Gly Leu Asp Leu lie Phe Gly Leu Asn Ala Leu 

230 235 240 



30 



35 



40 



45 



50 



55 



TTA ACA ACA CCA CAT TTG CAG TGG AAC ACT TCT AAT GCT CAG TTC 858 
Leu Arg Thr Ala Asp Leu Glh 7rp Asn Ser Ser Asn Ala Gin Leu 

245 250 255 

CTC CTC GAC TAC TGC TCT TCC AAG CGG TAT AAC ATT TCT TGC GAA 903 
Leu Leu Asp Tyr Cys Ser Ser Lys Gly Tyr Asn lie Ser Trp Glu 

260 265 270 

CTA CGC AAT GAA CCT AAC ACT TTC CTT AAG AAG CCT GAT ATT TTC 948 
(eu Gly Asn Glu Pro. Asn Ser. Phe Leu Lys Lys Ala Asp lie Phe 

275 280 285 

ATC AAT GGG TCC CAG TTA GGA GAA GAT TAT ATT CAA TTC CAT AAA 993 
lie Asn Gly Ser Gin Leu Gly Glu Asp Tyr lie Cln Leu His lys 

290 295 300 

CTT CTA AGA AAG TCC ACC TTC AAA AAT GCA AAA CTC TAT GCT CCT 1038 
Leu Leu Arg Lys Ser Thr Phe Lys Asn Ala Lys Leu Tyr Gly Pro 

305 310 315 

GAT CTT GGT CAG CCT CCA AGA AAG ACC GCT AAG ATC CTG AAG AGC 1083 
AsD val Gly Gin Pro Arg Arg Lys Thr Ala Lys Het Leu Lys Ser 

320 325 330 

TTC CTG AAG CCT CGT GCA GAA CTC ATT GAT TCA CTT ACA TGG CAT 1128 
Phe Leu Lys Ala Gly Gly Glu Val tie Asp Ser Val Thr Trp His 

335 340 345 

CAC TAC TAT TTG AAT CGA CGG ACT GCT ACC AGG GAA GAT TTT CTA 1173 
His Tyr Tyr Leu Asn Cly Arg thr Ala Thr Arg Glu Asp Phe Leu 

350 355 360 

AAC CCT CAT CTA TTC CAC ATT TTT ATT TCA TCT GTG CAA AAA CTT 1218 
Asn Pro Asp Val Leu Asp lie Phe lie Ser Ser Val Gin Lys Val 

565 370 375 

TTC CAC CTC CTT CAG ACC ACC ACC CCT GGC AAC AAC CTC TGC TTA 1263 
Phe Cln Val Val Glu Ser Thr Arg Pro Gly Lys Lys Val Trp Leu 

360 385 390 
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CGA GAA ACA AGC TCT GCA TAT G&A CGC GGA GCG CCC TTG CTA TCC 1308 
Cly Ctu Thr Ser Ser Ala Tyr Gly Gly Cly Ala Pro Leu Leu Ser 

395 400 405 

■ 

6AC ACC TTT CCA CCT CGC TTT ATC T6G CTG GAT AAA TTG 6CC CTC 1353 
Asp Thr Phe Ala Ala Gly Phe Ket Trp Leu Asp Lys Leu Gly Leu 

410 41S 420 

TCA GCC CCA ATG GGA ATA' gAA GTG GTG ATG AGG CAA GTA TTC TTT 1398 
Ser Ala Arg Ket Gly lie Glu Val Val Ket Arg Gin Val Phe Phe 

42S 430 43S 

CGA GCA CGA AAC TAC CAT TTA GTG GAT GAA AAC TTC GAT CCT TTA 1443 
Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe Asp Pro Leu 

440 445 450 

CCT GAT TAT TGG CTA TCT CTT CTG TTC AAG AAA TTG GTG GCC ACC 1488 
15 Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu Val Cly Thr 

455 460 465 

AAG GTG TTA ATG GCA AGC GTG CAA GGT TCA AAG AGA AGG AAG CTT 1533 
Lys Val Leu Met Ala Ser Val Gin Gly Ser Lys Arg Arg Lys Leu 

470 475 480 

CGA GTA TAC CTT CAT TGC ACA AAC ACT GAC AAT CCA AGG TAT AAA 1578 
Arg Val Tyr Leu Mis Cys Thr Asn Thr Asp Asn Pro Arg Tyr Lys 

485 490 495 

GAA GGA GAT TTA ACT CTG TAT GCC ATA AAC CTC CAT AAC CTC ACC 1623 
Glu Gly Asp Leu Thr Leu Tyr Ala lie Asn Leu His Asn Val Thr 
25 500 505 510 

AAG TAC TTC CGG TTA CCC TAT CCT TTT TCT AAC AAG CAA GTG GAT 1668 
Lys Tyr Leu Arg Leu Pro Tyr Pro Phe Ser Asn Lys Gin Val Asp 

515 520 525 



30 



35 



40 



45 



AAA TAC CTT CTA AGA CCT TTG GGA CCT CAT GGA TTA CTT TCC AAA 1713 
Lys Tyr Leu Leu Arg Pro Leu Gly Pro His Gly Leu Leu Ser Lys 

530 535 S40 

TCT CTC CAA CTC AAT GGT CTA ACT CTA AAG ATG GTG GAT GAT CAA 1758 
Ser Val Gin Leu Asn Gly Leu Thr Leu Lys Met Vat Asp Asp Gin 

545 550 555 

ACC TTG CCA CCT TTA ATG GAA AAA CCT CTC CGG CCA GGA ACT TCA 1803 
Thr Leu Pro Pro Leu Met Glu Lys Pro Leu Arg Pro Gly Ser Ser 

560 565 570 

CTG GGC TTG CCA GCT TTC TCA TAT ACT TTT TTT GTG ATA AGA AAT 1848 
Leu Gly Leu Pro Ala Phe Ser Tyr Ser Phe Phe Val tie Arg Asn 

575 580 585 * 

GCC AAA GTT GCT CCT TGC ATC TCA AAA TAA AAT ATA CTA CTC CTG 1893 
Ala Lys Vat Ala Ala Cys lie 

590 592 

ACA CTG 1899 



(2) INFORMATION FOR SEQ ID NO:16: 
(i) SEQUENCE CHARACTERISTICS: 

50 

(A) LENGTH: 594 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16 
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AT TACT AT AG GGCACGCCTC CTCCACCCCC CCGCCTGGTA TTCTCTTAAT GAGAAGTTGA 60 
TAAAGAATir TGGGTGGTTG ATCTCTTTCC AGCTOCACTT TAGCGTATCC TGAGGCCAGA 120 
TTTTrTCAGG CAAAAGTAAA ATACCTGAGA AACTGCCTCC CCAGAGGACA ATCAGATTTT 180 
GGCTGGCTCA ACTGACAAGC AAGTGTTTAT AAGCTAGATC GGAGAGGAAG GCATGAATAC 240 
TCCATTGGAG GCTTTACTCG AGGGTCAGAG GGATACCCGG CGCCATCAGA ATCGOATCTC 300 
GGACTCCGAA ACGCTGGGTT CCCACGAGAG CGCGCAGAAC ACGTGCG7CA GGAAGCCTGG 360 
TCCGGGATGC CCAGCCCTGC TCCCCGGGCG CTCCTCCCCG GGCGCTCCTC CCCAGGCCTC 420 
CCGCGCGCTT GGATCCCGGC CATCTCCGCA CCCTTCAAGT CGGTGTGCCT GATTTCGTAA 480 
GTGAACGTCA CCGCCACCGG GGCGAAAGCG AGCAAGGAAG TAGGAGAGAG CCCGGCAGCC S40 
CGCCCCCGGT TGCArrCCCA CCAGrCGGAG GGATGCAGAA GAGGACrcCG AGCG 594 

(2) INFORMATION FOR SEQ ID NO:17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17 

CCCCAGGAGC AGCAGCATCA G 21 



(2) INFORMATION FOR SEQ ID NO:18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18 



AGGCTTCGAG CGCAGCAGCA T 21 

(2) INFORMATION FOR SEQ ID NO:19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19 

GTAATACGAC TCACTATAGG CC 22 



(2) INFORMATION FOR SEQ 10 NO:20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 

5 • . , . 

ACTATAGGGC AC6CGTGGT 19 

(2) INFORMATION FOR SEQ ID NO:21: . 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 

20 

CTT66GCTCA CCTCGGTGCJ C 21 

(2) INFORMATION FOR SEQ ID N0:22: 
25 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID ND:22 

AGCTCTGTAG ATCTGCTATA CAC 23 

35 

(2) INFORMATION FOR SEQ ID NO:23: 
(i) SEQUENCE CHARACTERISTICS: 

40 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:23 

GCATCTTA6C CGTCTTTCTT CG 22 

50 

Claims 

1. An isolated polynucleotide fragment comprising a polynucleotide sequence encoding a polypeptide having hepara- 
55 nase catalytic activity, wherein said polypeptide shares at least 70% homology with SEQ ID NOs: 10 or 14 or a 

functional fragment thereof having heparanase catalytic activity. 

2. The polynucleotide fragment of claim 1, wherein said polynucleotide sequence includes nucleotides 63 - 1691 of 
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SEQ ID NO: 9 or nucleotides 139 - 1869 of SEQ ID NO: 13. 

3. The polynucleotide fragment according to claim 1 or 2, wherein said polynucleotide sequence includes nucleotides 
63 - 721 of SEQ ID NO- 9. 

4. The polynucleotide, fragment of claim 1, wherein said polynucleotide is as set forth in SEQ ID NO: 9 or 13. 

5. The polynucleotide fragment of claim 1 , wherein said polypeptide includes an amino acid sequence as set forth in 
SEQ ID NOs: 10 or 14, 

6. A polynucleotide fragment comprising a polynucleotide sequence at least 70% homologous with SEQ ID NOs: 9 or 
13, said polynucleotide sequence encoding a polypeptide having heparanase catalytic activity. 

7. The polynucleotide fragment of any preceding claim, wherein said polynucleotide sequence is selected from the 
15 group consisting of double stranded DNA, single stranded DNA and RNA. 

8. A vector comprising a polynucleotide fragment according to any preceding claim. 

9. A host cell comprising an exogenous polynucleotide fragment according to any of claims 1 to 7. 

20 

10. A recombinant protein, which is a polypeptide of 543 amino acids as set forth in SEQ ID NO: 10 with a calculated 
molecular weight of 61 ,192 daltons or a functional part thereof 

1 1. A polypeptide of 592 amino acids as set forth in SEQ ID NO: 14 with a calculated molecular weight of 66,407 daltons 
25 or a functional part thereof. 

12. A pharmaceutical composition comprising, as an active ingredient, the isolated protein/poly peptide according to 
claim 10 or 1 1 and a pharmaceuticaliy acceptable carrier. 

30 13. A medical device containing, as an active ingredient, an isolated protein/poty peptide according to claim 10 or 1 1. 

14. A heparanase overexpression system comprising a cell overexpressing heparanase sharing at least 70% homology 
with SEQ ID Nos: 10 or 14 and encoded by a polynucleotide fragment as claimed in any one of claims 1 to 7. 

35 15. A method of identifying a chromosome region harbouring a heparanase gene in a chromosome spread comprising 
the steps of: 

(a) hybridizing the chromosome spread with a tagged polynucleotide probe according to claims 6 or 7, 

(b) washing the chromosome spread, thereby removing excess of non-hybridized probe; and 

^0 (c) searching for signals associated with said hybridized tagged probe, wherein detected signals being indicative 

of a chromosome region harbouring a heparanase gene. 
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Patentanspruche 

1 . Isoliertes Polynucleotidfragment, umfassend eine Polynucleotidsequenz, die ein Polypeptid codiert, das Heparana- 
se-katalytische Aktivitat besitzt, wobei das Polypeptid mindestens 70% Homologie mit SEQ ID NR:10 oder 14 Oder 
einem funktionalen Fragment davon mit Heparanase-katalytischer Aktivitat aufweist. 

50 2, Polynucleotidfragment nach Anspruch 1, wobei die Polynucleotidsequenz Nucleotide 63-1691 von SEQ ID NR:9 
Oder Nucleotide 139-1869 von SEQ ID NR:13 umfasst. 

3. Polynucleotidfragment nach Anspruch 1 oder 2, wobei die Polynucleotidsequenz Nucleotide 63-721 von SEQ ID 
NR:9 umfasst. 



55 



4. Polynucleotidfragment nach Anspruch 1, wobei das Polynucleotid wie in SEQ ID NR:9 Oder 13 dargestellt ist. 

5. Polynucleotidfragment nach Anspruch 1, wobei das Polypeptid eine wie in SEQ ID NR:10 oder 14 dargestellte 



38 



EP 0 998 569 B1 



5 



10 



15 



Aminosauresequenz umfasst. 

6. Polynucleotidfragment, umfassend eine Polynucleotidsequenz mit mindestens 70% Homologie mit SEQ ID NR:9 
Oder 13, wobei die Polynucleotidsequenz ein Polypeptid mit Heparanase-katalytischer Aktivitat codiert. 

7. Polynucleotidfragment nach einem der vorangegangenen Anspruche, wobei die Polynucleotidsequenz ausgewahit 
ist aus einer Gruppe bestehend aus doppelstrangiger DNA, einzelstrangiger DNA und RNA. 

8. Vektor, umfassend ein Polynucleotidfragment nach einem der vorangegangenen Anspruche. 

9. Wirtszelle, umfassend ein exogenes Polynucleotidfragment nach einem der Anspruche 1 bis 7. 

10. Rekombinantes Protein, welches ein Polypeptid mit 543 Aminosauren ist, wie dargestellt in SEQ ID NR:10, mit 
einem berechneten Molekulargewicht von 61.192 Dalton, oder ein funktionelles Fragment davon. 

11. Polypeptid mit 592 Aminosauren, wie dargestellt in SEQ ID NR:14, mit einem berechneten Molekulargewicht von 
66.407 Dalton, oder ein funktionelles Fragment davon. 

12. Arzneimittel, umfassend, als einen aktiven Bestandteil, das isolierte Protein/Polypeptid nach Anspruch 10 oder 11 
20 und einen pharmazeutisch vertraglichen Trager. 

13. Medizinprodukt, enthaltend, als einen aktiven Bestandteil, ein isoliertes Protein/Polypeptid nach Anspruch 10 oder 
11. 

25 14. Heparanase-Oberexpressionssystem, umfassend eine Zelle, welche eine Heparanase uberexprimiert, die minde- 
stens 70% Homologie mit SEQ ID NR:10 oder 14 aufweist und durch ein Polynucleotidfragment gemaft einem der 
Anspruche 1 bis 7 codiert wird. 

15. Verfahren zur Identifizierung einer Chromosomenregion, die ein Heparanasegen enthalt, in einer Chromosomen- 
30 ansammlung, umfassend die Schritte: 

(a) Hybridisieren der Chromosomenansammlung mit einer markierten Polynucleotidsonde nach einem der An- 
spruche 6 Oder 7; 

(b) Waschen der Chromosomenansammlung, wobei ein Uberschuss von nicht hybridisierter Sonde entfernt 
35 wird; und 

(c) Suchen nach Signalen, die mit der hybridisierten markierten Sonde assoziiert sind, wobei der Nachweis von 
Signalen auf eine Chromosomenregion hinweist, die ein Heparanasegen enthalt. 

40 Revendications 

1. Fragment polynucleotidique isole comprenant une sequence polynucleotidique codant un polypeptide ayant une 
activite catalytique heparanase, dans lequel ledit polypeptide possede au moins 70% d'homologie avec SEQ ID 
NO : 10 ou 14 ou un fragment fonctionnel de celles-ci ayant une activite catalytique heparanase. 

45 

2. Fragment polynucleotidique selon la revendication 1 , dans lequel la sequence polynucleotidique inclut les nucleotides 
63 - 1691 de SEQ ID NO : 9 ou les nucleotides 139 -1869 de SEQ ID NO 13. 

3. Fragment polynucleotidique selon la revendication 1 ou 2, dans lequel la sequence polynucleotidique inclut les 
50 nucleotides 63 - 721 de SEQ ID NO : 9. 

4. Fragment polynucleotidique selon la revendication 1, dans lequel ledit polynucleotide est tel que represents dans 
SEQ ID NO:9ou13. 

55 5. Fragment polynucleotidique selon la revendication 1, dans lequel ledit polypeptide inclut une sequence d'acides 
amines telle que representee dans SEQ ID NO : 10 ou 14. 

6. Fragment polynucleotidique comprenant une sequence polynucleotidique au moins 70% homologue a SEQ ID 
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NOs : 9 ou 13, iadite sequence polynucl^otidique codant un polypeptide ayant une activite catalytique heparanase. 

7. Fragment polynucleottdique selon Tune quelconque des revendications precedentes. dans lequel tadite sequence 
polynucleotidique est choisie dans le groupe constitue d'ADN double-brin. d'ADN simple-brin et d'ARN. 

5 

8. Vecteur comprenant un fragment polynucleotidique selon Tune quelconque des revendications precedentes. 

9. Cellule hote comprenant un fragment polynucleotidique exogene selon Tune des revendications 1 a 7. 

10 10. Proteine recombinante, qui est un polypeptide de 543 acides amines tel que represents dans SEQ ID NO : 10 ayant 
un poids moleculaire calcule de 61 192 daitons. ou une partte fonctionnelle de celui-ci. 

11. Polypeptide de 592 acides amines tel que represente dans SEQ ID NO : 14 ayant un poids moleculaire calcule de 
66 407 daitons, ou une partie fonctionnelle de celui-ci. 

15 

12. Composition pharmaceutique comprenant, en tant qu'ingredient actif, la(le) proteine/polypeptide isole(e) selon la 
revendication 10 ou 11 et un support acceptable sur te plan pharmaceutique. 

13. Dispositif medical oontenant, en tant qu'ingredient actif, un(e) prot6ine/polypeptide isole(e) selon la revendication 
20 10ou11. 

14. Systeme pour la sur-expression d'une heparanase, comprenant une cellule sur-exprimant une heparanase ayant 
a u moins 70% d'homologie avec SEQ I D NO : 1 0 ou 1 4 et codee par un fragment polynucleotidique tel que revendiq ue 
dans Tune des revendications 1 a 7. 

25 

15. Methode pour identifier une region chromosomique p)ortant un gene heparanase a partir d'un etalement de chro- 
mosomes, comprenant les etapes consistent a : 

(a) hybrider I'etalementde chromosomes avec une sonde polynucleotidique etiquetee seion les revendications 
30 6 ou 7, 

(b) laver I'etalement de chromosomes, eliminant ainsi I'exces de sonde non-hybridee ; et 

(c) rechercher des signaux associes a Iadite sonde etiquetee, les signaux detectes etant indicatifs d'une region 
chromosomique portant un gene heparanase. 

35 
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1 CTAuAGCTTTCGACTCTCCGCTGCGCGGCftGCTGGCGGGGGGAGCAGCCACGTGAGCCCA 

6 1 AG;;?GCT3CT-3CGCTCGAAGCCTGCGCTGCC(K:CGCC(5CTGATGCTGCrGCTCCTG3-3GC 
MLLRSKPA^LP PPtKLLLLGP 

12 1 CGCTG-:-^TCCCCTCTCCCCTGGCGCCCTGCCCCGACCTGCGCAAGCACAGGACGTC57GG 
LGPLS P & AL PRPAQAQOVVD 

1 B 1 ACC iG<=;jCT7C?TCACCCAGGAGCCGC TGCACCTGGTGAGCCCCTCGTrcCTGTCCGTCA 
LDrrTQCPLH LVS PSTLSVT 

241 CCA7T3AC3CCAP,CCTG3CCACGGACCCGCGGTTCCTCATCCTCC TGGGTTCtCCAJJwC 
IDANLATOPR PL I LLG5 PKL 

30 1 TTCGTACCrrSGCC-^GGCTTGrCTCCtGCG TACCTGAGGTTTGGTGGCACCPJil^ACAG 
RTtARGLS PAlfLR PGGTKTD 

361 AC TTCZ 7?AT r-?C 3ATCCCLBJ«iAACGAATCAfcCCTTTGAAG ASAGA>.5TT.^^ 

FLIFDPKKESTFEER SYWQS 

421 CTCX-^-?C>>»CC^JsGATAtTTGCAAAT ATGGATCCATCCCTCCTGATGTGGA^ 

QVNQDI .CKYG S I P PDVEEXL 

481 TAC35TTGCwTGGCCCTACCAGGAGCAAT7GCTACTCCGAGXACACTACCAGA.^^^ 

RLEWPYQEQLL LREHYQKXF 

541 TCA.:iGJiACAGC.-JCCTAC tCAAGAAGC TCTGTAGATGTGCTATACACTTTTGCA.-AC *- 

KSSTYSRSSVDVLYTFAHCS 

GLDLI FGLNALLRTADLQWH 

662. ACA-?':~CT.---.CC-C7 C.VST TGC7CCTGGACTAC TGC7CT7CC:AA3SGGr A7> J^^It TC7T 
SSN'AQLLLDYCS S iCGYK'TSW 

721 GS-3X-.CrA33C.-AT-3.AACC7AACAGTT7CC77AAG^^ 

ELGir=.E PNS F L KK-AD XFIWGS 

<7)' 

781 CGCA3TIAG^AGAiUATTATArTCAATTGCATAAACTTCTAAGAPAGTCCACCT7CAAA?\ 
QL GZDYIQLKKLLRKSTrKii 

(T) 

841 A73rJ^AX=^CTC-A7G3TCCTGA7G77GG7CAGCC7CGAAGAAAGACG:=CTAAGAT'3^ 

AXLYGPDVGQPRRKTAKMLK 

901 >J3A3r r TwCr3AHG3CTGGTGGAGAAG7GA77GA7?CAG77ACA7GK:A7CAC7ACr^^ 
£rLKAGGEVIDSV7WKSYYI. 

iJ bl 73.-A~ 3?jC 33ACTGCTACCAGGGAAGATT 7TCTAA.AJCCCTGATGTATTG-3ACATV Z^ZZ. , 
NGKTATREDFLM PDVLDIFI 

1021 TTTCArC7G7G^jyiAAGTT7TCCAGG7GGT7GAGAGCACCAGGCCTGGCAA'SA^ 

SSVQKVFQVVES T R PGKKVW 

IOC 1 GGr:A>3AGAAACAi<SCTCTGCA?ATGGAGGCSGAGCGCCCT7GC7ATCCGACACCr7?G 
LGE7 S SAYGGGAPLLS DTFA 

1141 CA3Cr3 K:77~A~;TGGCTGdA.7AAAT7GGGCC7GTCA&CCCGAA?GGC3?A7A«AA373G 
AGFMWLDKI.GLSARMGIEVV 

1201 7Gijr3.V;-GC AS^7ArTC7TTGSAGCAGGAAAC7ACCATTTAGTGGA7GA=L^^^ 

MRQVFFGAGNYB LVOEN'FO? 

1261 C7 77ACCT-3A7TA:rTSSCTATCTCrTCTGT7CAAGAAAr7&3T6GGCACCA2^ 

I.?DYWLS1.I»FKK1.V. GTKVLM 

1321 TGGCAA£-CGTGCAAGGTTCAAAGAGAAGGAAGCT7CGAGTATACC7TCATTGCACAAV:A 
ASVQG5 KRRKLRVY LHCTNT 

1381 CTa=LC.-.3.7CCJVlGG7ATAiiAGAAGGAGAT7TAAC7C7GTA7GCCA7AAACCTCCAT 

DflPRYKEGDl*7I.YAXKLaNV 

1441 TwXCAA37ACT7GCGG7TACCCTATCCT77T7CTAACAAGCAAG7GG.:vrAA?-7ACCTTC 
T KtLRLPY PFSKKQVOKYLL 

1501 T;^A^E?.CC7T7GGGACCTCATGGATTACTTTCCAAATCTG7CCAACTCAATSGTC7.^^TC 
RPLGPaGLLSKSVOI^NGLTL 

1361 TAA^ATGCrSGATGATCAAACC TTGCCAC C rT7AATGGAAAAilCC7CTCCGGC C.A5SAA 
KKVDDQT1*PPLMEKPI.RPGS 

1521 GTTCA3TO-3GCTTGCCAC^T7TCTC ATAT;^C-TTTT7TTG7GATAAGAA?-C-CC;.A-37T^ 
5LGLPA rs t s rrVlRNAKVA 
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FIG. 3A 
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FIG. 4 
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FIG. 7 A 
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FIG. 8A 
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FIG. 9A 
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FIG. lOA 
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FIG. IIA 
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FIG. 12A FIG. 12B 
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Fig. 13 
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CTGGCAAGAAGGTCTGGTTGGGAGAGACGAGCTCAGCTTACGGTGGCGGT 5 a . 

1 1 1 1 1 1 1 11 1 1 1 1 1 n 11 I I I I I I II I I I I i 11 II II I I I I I 

CTGGCAAGAAGGTCTGGTTAGGAGAAACAAGCTCTGCATATGGAGGCGGA 1115 

» • • • • 

GCACCCTTGCTGTCCAACACCTTTGCAGCTGGCTTTATGTGGCTGGATAA 100 

II I I M M M III I M M II N I I i I I I I I I 11 I I I 1 i II I I I I i I I 
GCGCCCTTGCTATCCGACACCTTTGCAGCTGGCTTTATGTGGCTGGATAA 1165 

• • . • • • 
ATTGGGCCTGTCAGCCCAGATGGGCATAGAAGTCGTGATGAGGCAGGTGT 150 

II I t M II I M It II I I It I II I I I 11 I M I I I I I I I I I I I II I 

ATTGGGCCTGTCAGCCCGAATGGGAATAGAAGTGGTGATGAGGCAAGTAT 1215 

• • • • • 
TCTTCGGAGCAGGCAACTACCACTTAGTGGATG7>AAACTTTGAGCCTTTA 200 
MM I II M I II MINIM M I M I I 1 M t I II M I II I I M M 

TCTTTGGAGCAGGuAAACTACCATTTAGTGGATGAAAACTTCGATCCTTTA 1265 

• • • • ■ 
CCTGATTACTGGCTCTCTCTTCTGTTCAAGAAACTGGTAGGTCCCAGGGT 250 

M M II M Hill t M M M M I I i M I M I MM I i III III 

CCTGATTATTGGCTATCTCTTCTGTTCAAGAAATTGGTGGGCACCAAGGT 1315 

• • • • • 
GTTACTGTCAAGAGTGAAAGGCCCAGACAGGAGCAAACTCCGAGTGTATC 300 

MM II till III I t M II I II II I i II I I I I.I II I 

GTTAATGGCAAGCGTGCAAGGTTCAAAGAGAAGGAAGCTTCGAGTATACC* 1365 

• • • * « 
TCCACTGCACTAACGTCTATCACCCACGATATCAGGAAGGAGATCTAACT 350 

I M I II II I I I I I I I I I i II I I M M M 1! M M I 

TTCATTGCACA^ACACTGACAATCCAAGGTATAAAGAAGGAGATTTAACT 1415 



mouse CtGTATGTCCTGAACCTCCATAATGTCACCAAGCACTTGAAGGTACCGCC 4 00 
M II I M I I 111-11111111 M M II I I l~ M III I Ml I 

human CTGTATGCCATA^^^CCTCCATAACGTCACCAAGTACTTGCGGTTACCCTA 1465 

^ • * • • 

mouse TCCGTTGTTCAGGAAACCAGTGGATACGTACCTTCTGAAGCCTTCGGGGC 4 50 

Ml II I I M i II I I II M I II li I II I MM III I 

human TCCTTTTTCTAACAAGCAAGTGGATAAATACCTTCTAAGACCTTTGGGAC 1515 

« • • • a 

mouse CGGATGGATTACTTTCCAAATCTGTCCAACTGAACGGTCAAATTCTGAAG 500 

I I I I M I I M I I I 1 i I I I I I I I I M I II I I I I I I I II III III 

human CTCATGGATTACTTTCCAAATCTGTCCAACTCAATGGTCTAACTCTAAAG 1565 

« • » • • 

mouse ATGGTGGATGAGCAGACCCTGCCAGCTTTGACAGAAAAACGTCTCCCCGC 550 

II 1 I M M I I I HIM I I t M MM I I M I I M I M II I I 

human ATGGTGGATGATCAAACCTTGCCACCTTTAATGGAAAAACCTCTCCGGCC 1615 

« • • ~ • • 

mouse AGGAAGTGCACTAAGCCTGCCTGCCTTTTCCTATGGTTTTTTTGTCATAA 600 

I M I I I I INI M INI II II M III M II M M I I MM 

human AGGAAGTTCACTGGGCTTGCCAGCTTTCTCATATAGTTTTTTTGTGATAA 1665 

• • * 

mouse GAAATGCCAAAATCGCTGCTTGTAT ATGAA AATAAAA 637 

IMMIIIMI I Ml IN II II MMIIIMM 
human GAAATGCCAAAGTTGCTGCTTGCATCTGAAAATAAAA 1702 
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